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Alzheimer's Disease Secretase, APP Substrates Therefor, and Uses Therefor 

The present application is a continuation-in-part of United States 
Patent Application 09/416,901, filed October 13, 1999 which claims priority benefit 
5 of United States Provisional Patent Application No. 60/1 55,493, filed September 23, 
1999 and United States Provisional Patent Application 60/169,232, filed December 6, 
1 999. The present application also claims priority benefit as a continuation-in-part of 
United States Patent Application Serial No. 09/404,133 and PCT/US99/20881, both 
filed September 23, 1999, both of which in turn claim priority benefit of United States 
10 Provisional Patent Application No. 60/101,594, filed September 24, 1998. All of 
these priority applications are hereby incorporated by reference in their entirety. 



FIELD OF THE INVENTION 

The present invention relates to Alzheimer's Disease, amyloid protein 
15 precursor, amyloid beta peptide, and human aspartyl proteases, as well as a method 
for the identification of agents that modulate the activity of these polypeptides and 
thereby are candidates to modulate the progression of Alzheimer's disease. 



BACKGROUND OF THE INVENTION 

20 Alzheimer's disease (AD) causes progressive dementia with consequent 

formation of amyloid plaques, neurofibrillary tangles, gliosis and neuronal loss. The 
disease occurs in both genetic and sporadic forms whose clinical course and 
pathological features are quite similar. Three genes have been discovered to date 
which, when mutated, cause an autosomal dominant form of Alzheimer's disease. 

25 These encode the amyloid protein precursor (APP) and two related proteins, 
presenilin-1 (PS1) and presenilin-2 (PS2), which, as their names suggest, are 
structurally and functionally related. Mutations in any of the three proteins have been 
observed to enhance proteolytic processing of APP via an intracellular pathway that 
produces amyloid beta peptide (A0 peptide, or sometimes here as Abeta), a 40-42 

30 amino acid long peptide that is the primary component of amyloid plaque in AD. 
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Dysregulation of intracellular pathways for proteolytic processing may be 
central to the pathophysiology of AD. In the case of plaque formation, mutations in 
APP, PS1 or PS2 consistently alter the proteolytic processing of APP so as to enhance 
formation of Ap 1-42, a form of the Ap peptide which seems to be particularly 
5 amyloidogenic, and thus very important in AD. Different forms of APP range in size 
from 695-770 amino acids, localize to the cell surface, and have a single C-terminal 
transmembrane domain. Examples of specific isotypes of APP which are currently 
known to exist in humans are the 695-amino acid polypeptide described by Kang eL 
ai (1987), Nature 325: 733-736 which is designated as the "normal" APP; the 751 

10 amino acid polypeptide described by Ponte et ai (1988), Nature 331: 525-527 (1988) 
and Tanzi et al (1988), Nature 331: 528-530; and the 770 amino acid polypeptide 
described by Kitaguchi et al (1988), Nature 331: 530-532. The Abeta peptide is 
derived from a region of APP adjacent to and containing a portion of the 
transmembrane domain. Normally, processing of APP at the cc-secretase site cleaves 

15 the midregion of the AP sequence adjacent to the membrane and releases the soluble, 
extracellular domain of APP from the cell surface. This oc-secretase APP processing 
creates soluble APP- a, (sAPPa) which is normal and not thought to contribute to AD. 

Pathological processing of APP at the p- and y-secretase sites, which are 
located N-terminal and C-terminal to the cc-secretase site, respectively, produces a 

20 very different result than processing at the a site. Sequential processing at the P- and 
Y-secretase sites releases the Ap peptide, a peptide possibly very important in AD 
pathogenesis. Processing at the P- and y-secretase sites can occur in both the 
endoplasmic reticulum (in neurons) and in the endosomal/lysosomal pathway after 
reinternalization of cell surface APP (in all cells). Despite intense efforts, for 10 years 

25 or more, to identify the enzymes responsible for processing APP at the p and y sites, 
to produce the AP peptide, those proteases remained unknown until this disclosure. 



30 
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SUMMARY OF THE INVENTION 

Here, for the first time, we report the identification and characterization of the 
P secretase enzyme, termed Aspartyl Protease 2 (Asp2). We disclose some known 
and some novel human aspartic proteases that can act as P-secretase proteases and, for 
5 the first time, we explain the role these proteases have in AD. We describe regions in 
the proteases critical for their unique function and for the first time characterize their 
substrate. This is the first description of expressed isolated purified active protein of 
this type, assays that use the protein, in addition to the identification and creation of 
useful cell lines and inhibitors. We also identify and characterize both a-secretase and 

1 0 P-secretase activities of a protease, designated as Aspl . 

Here we disclose a number of variants of the Asp2 gene and peptide. 
In one aspect,, the invention provides any isolated or purified nucleic acid 
polynucleotide that codes for a protease capable of cleaving the beta (P) secretase 
cleavage site of APP that contains two or more sets of special nucleic acids, where 

15 the special nucleic acids are separated by nucleic acids that code for about 100 to 300 
amino acid positions, where the amino acids in those positions may be any amino 
acids, where the first set of special nucleic acids consists of the nucleic acids that code 
for the peptide DTG, where the first nucleic acid of the first special set of nucleic 
acids is the first special nucleic acid, and where the second set of nucleic acids code 

20 for either the peptide DSG or DTG, where the last nucleic acid of the second set of 
nucleic acids is the last special nucleic acid, with the proviso that the nucleic acids 
disclosed in SEQ ID NO. 1 and SEQ ID NO. 3 are not included. In a preferred 
embodiment, the two sets of special nucleic acids are separated by nucleic acids that 
code for about 125 to 222 amino acid positions, which may be any amino acids. In a 

25 highly preferred embodiment, the two sets of special nucleic acids are separated by 
nucleic acids that code for about 150 to 196, or 150-190, or 150 to 172 amino acid 
positions, which may be any amino acids. In a particular preferred embodiment, the 
two sets are separated by nucleic acids that code for about 1 72 amino acid positions, 
which may be any amino acids. An exemplary nucleic acid polynucleotide comprises 

30 the acid nucleotide sequence in SEQ ID NO. 5. In another particular preferred 
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embodiment, the two sets are separated by nucleic acids that code for about 196 
amino acids. An exemplary polynucleotide comprises the nucleotide sequence in 
SEQ ID NO. 5. In another particular embodiment, the two sets of nucleotides are 
separated by nucleic acids that code for about 190 amino acids. An exemplary 
5 polynucleotide comprises the nucleotide sequence in SEQ ID NO. 1 . Preferably, the 
first nucleic acid of the first special set of amino acids, that is, the first special nucleic 
acid, is operably linked to any codon where the nucleic acids of that codon codes for 
any peptide comprising from 1 to 10,000 amino acid (positions). In one variation, the 
first special nucleic acid is operably linked to nucleic acid polymers that code for any 

10 peptide selected from the group consisting of: any reporter proteins or proteins which 
facilitate purification. For example, the first special nucleic acid is operably linked to 
nucleic acid polymers that code for any peptide selected from the group consisting of: 
immunoglobin-heavy chain, maltose binding protein, glutathione S transferase, Green 
Fluorescent protein, and ubiquitin. In another variation, the last nucleic acid of the 

15 second set of special amino acids, that is, the last special nucleic acid, is operably 

linked to nucleic acid polymers that code for any peptide comprising any amino acids 
from 1 to 10,000 amino acids. In still another variation, the last special nucleic acid is 
operably linked to nucleic acid polymers that code for any peptide selected from the 
group consisting of: any reporter proteins or proteins which facilitate purification. For 

20 example, the last special nucleic acid is operably linked to nucleic acid polymers that 
code for any peptide selected from the group consisting of: immunoglobin-heavy 
chain, maltose binding protein, glutathione S transferase, Green Fluorescent protein, 
and ubiquitin. 

In a related aspect, the invention provides any isolated or purified nucleic acid 
25 polynucleotide that codes for a protease capable of cleaving the beta secretase 

cleavage site of APP that contains two or more sets of special nucleic acids, where the 
special nucleic acids are separated by nucleic acids that code for about 100 to 300 
amino acid positions, where the amino acids in those positions may be any amino 
acids, where the first set of special nucleic acids consists of the nucleic acids that code 
30 for DTG, where the first nucleic acid of the first special set of nucleic acids is the first 
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special nucleic acid, and where the second set of nucleic acids code for either DSG or 
DTG, where the last nucleic acid of the second set of special nucleic acids is the last 
special nucleic acid, where the first special nucleic acid is operably linked to nucleic 
acids that code for any number of amino acids from zero to 81 amino acids and where 
5 each of those codons may code for any amino acid. In a preferred embodiment, the 
first special nucleic acid is operably linked to nucleic acids that code for any number 
of from 64 to 77 amino acids where each codon may code for any amino acid. In a 
particular embodiment, the first special nucleic acid is operably linked to nucleic acids 
that code for 71 amino acids. For example, the first special nucleic acid is operably 

10 linked to 71 amino acids and where the first of those 71 amino acids is the amino acid 
- T. In a preferred embodiment, the polynucleotide comprises a sequence that is at least 
95% identical to a human Aspl or Asp2 sequence as taught herein. In another 
preferred embodiment, the first special nucleic acid is operably linked to nucleic acids 
that code for any number of from 30 to 54 amino acids, or 35 to 47 amino acids, or 

15 40 to 54 amino acids where each codon may code for any amino acid. In a particular 
embodiment, the first special nucleic acid is operably linked to nucleic acids that 
code for 47 amino acids. For example, the first special nucleic acid is operably linked 
to 47 codons where the first those 47 amino acids is the amino acid E. 

In another related aspect, the invention provides for any isolated or purified 

20 nucleic acid polynucleotide that codes for a protease capable of cleaving the beta (P) 
secretase cleavage site of AFP and that contains two or more sets of special nucleic 
acids, where the special nucleic acids are separated by nucleic acids that code for 
about 100 to 300 amino acid positions, where the amino acids in those positions may 
be any amino acids, where the first set of special nucleic acids consists of the nucleic 

25 acids that code for the peptide DTG, where the first nucleic acid of the first special set 
of amino acids is, the first special nucleic acid, and where the second set of special 
nucleic acids code for either the peptide DSG or DTG, where the last nucleic acid of 
the second set of special nucleic acids, the last special nucleic acid, is operably linked 
to nucleic acids that code for any number of codons from 50 to 170 codons. In a 

30 preferred embodiment, the last special nucleic acid is operably linked to nucleic acids 
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comprising from 100 to 170 codons. In a highly preferred embodiment, the last 
special nucleic acid is operably linked to nucleic acids comprising from 142 to 163 
codons. In a particular embodiment, the last special nucleic acid is operably linked to 
nucleic acids comprising about 142 codons, or about 163 codons, or about 170 
5 codons. In a highly preferred embodiment, the polynucleotide comprises a sequence 
that is at least 95% identical to aspartyl-protease encoding sequences taught herein. In 
one variation, the second set of special nucleic acids code for the peptide DSG. In 
another variation, the first set of nucleic acid polynucleotide is operably linked to a 
peptide purification tag. For example, the nucleic acid polynucleotide is operably 

10 linked to a peptide purification tag which is six histidine. In still another variation, 
the first set of special nucleic acids are on one polynucleotide and the second set of 
special nucleic acids are on a second polynucleotide, where both first and second 
polynucleotides have at lease 50 codons. In one embodiment of this type, both of the 
polynucleotides are in the same solution. In a related aspect, the invention provides a 

15 vector which contains a polynucleotide as described above, or a cell or cell line which 
is transformed or transfected with a polynucleotide as described above or with a 
vector containing such a polynucleotide. 

In still another aspect, the invention provides an isolated or purified peptide or 
protein comprising an amino acid polymer that is a protease capable of cleaving the 

20 beta (P) secretase cleavage site of APP that contains two or more sets of special amino 
acids, where the special amino acids are separated by about 100 to 300 amino acid 
positions, where each amino acid position can be any amino acid, where the first set 
of special amino acids consists of the peptide DTG, where the first amino acid of the 
first special set of amino acids is, the first special amino acid, where the second set of 

25 amino acids is selected from the peptide comprising either DSG or DTG, where the 
last amino acid of the second set of special amino acids is the last special amino acid, 
with the proviso that the proteases disclosed in SEQ ID NO. 2 and SEQ ID NO. 4 are 
not included. In preferred embodiments, the two sets of amino acids are separated by 
about 125 to 222 amino acid positions or about 150 to 196 amino acids, or about 150- 

30 190 amino acids, or about 150 to 172 amino acids, where in each position it may be 
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any amino acid. In a particular embodiment, the two sets of amino acids are separated 
by about 1 72 amino acids. For example, the protease has the amino acid sequence 
described in SEQ ID NO 6. In another particular embodiment, the two sets of amino 
acids are separated by about 196 amino acids. For example, the two sets of amino 
5 acids are separated by the same amino acid sequences that separate the same set of 

special amino acids in SEQ ID NO 4. In another particular embodiment, the two sets 
of nucleotides are separated by about 190 amino acids. For example, the two sets of 
nucleotides are separated by the same amino acid sequences that separate the same set 
of special amino acids in SEQ ID NO 2. In one embodiment, the first amino acid of 

10 the first special set of amino acids, that is, the first special amino acid, is operably 
linked to any peptide comprising from 1 to 1 0,000 amino acids. In another 
embodiment, the first special amino acid is operably linked to any peptide selected 
from the group consisting of: any reporter proteins or proteins which facilitate 
purification. In particular embodiments, the first special amino acid is operably linked 

1 5 to any peptide selected from the group consisting of: immunoglobin-heavy chain, 
maltose binding protein, glutathione S transferase, Green Fluorescent protein, and 
ubiquitin. In still another variation, the last amino acid of the second set of special 
amino acids, that is, the last special amino acid, is operably linked to any peptide 
comprising any amino acids from 1 to 10,000 amino acids. By way of nonlimiting 

20 example, the last special amino acid is operably linked any peptide selected from the 
group consisting of any reporter proteins or proteins which facilitate purification. In 
particular embodiments, the last special amino acid is operably linked to any peptide 
selected from the group consisting of: immunoglobin-heavy chain, maltose binding 
protein, glutathione S transferase, Green Fluorescent protein, and ubiquitin. 

25 In a related aspect, the invention provides any isolated or purified peptide or 

protein comprising an amino acid polypeptide that codes for a protease capable of 
cleaving the beta secretase cleavage site of APP that contains two or more sets of 
special amino acids, where the special amino acids are separated by about 100 to 300 
amino acid positions, where each amino acid in each position can be any amino acid, 

30 where the first set of special amino acids consists of the amino acids DTG, where the 
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first amino acid of the first special set of amino acids is, the first special amino acid, 
D, and where the second set of amino acids is either DSG or DTG, where the last 
amino acid of the second set of special amino acids is the last special amino acid, G, 
where the first special amino acid is operably linked to amino acids that code for any 
5 number of amino acids from zero to 81 amino acid positions where in each position it 
may be any amino acid. In a preferred embodiment, the first special amino acid is 
operably linked to a peptide from about 30-77 or about 64 to 77 amino acids positions 
where each amino acid position may be any amino acid. In a particular embodiment, 
the first special amino acid is operably linked to a peptide 35, 47, 71, or 77 amino 

10 acids. In a very particular embodiment, the first special amino acid is operably linked 
to 71 amino acids and the first of those 71 amino acids is the amino acid T. For 
example, the polypeptide comprises a sequence that is at least 95% identical to an 
aspartyl protease sequence as described herein. In another embodiment, the first 
special amino acid is operably linked to any number of from 40 to 54 amino acids 

1 5 (positions) where each amino acid position may be any amino acid. In a particular 
embodiment, the first special amino acid is operably linked to amino acids that code 
for a peptide of 47 amino acids. In a very particular embodiment, the first special 
amino acid is operably linked to a 47 amino acid peptide where the first those 47 
amino acids is the amino acid E. In another particular embodiment, the first special 

20 amino acid is operably linked to the same corresponding peptides from SEQ ID NO. 3 
that are 35, 47, 71, or 77 peptides in length, beginning counting with the amino acids 
on the first special sequence, DTG, towards the N-terminal of SEQ ID NO. 3. In 
another particular embodiment, the polypeptide comprises a sequence that is at least 
95% identical to the same corresponding amino acids in SEQ ID NO. 4, that is, 

25 identical to that portion of the sequences in SEQ ID NO. 4, including all the 

sequences from both the first and or the second special nucleic acids, toward the - 
terminal, through and including 71, 47, 35 amino acids before the first special amino 
acids. For example, the complete polypeptide comprises the peptide of 71 amino 
acids, where the first of the amino acid is T and the second is Q. 
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Ln still another related aspect, the invention provides any isolated or purified 
amino acid polypeptide that is a protease capable of cleaving the beta (P) secretase 
cleavage site of APP that contains two or more sets of special amino acids, where the 
special amino acids are separated by about 100 to 300 amino acid positions, where 
5 each amino acid in each position can be any amino acid, where the first set of special 
amino acids consists of the amino acids that code for DTG, where the first amino acid 
of the first special set of amino acids is, the first special amino acid, D, and where the 
second set of amino acids are either DSG or DTG, where the last amino acid of the 
second set of special amino acids is the last special amino acid, G, which is operably 

10 linked to any number of amino acids from 50 to 170 amino acids, which may be any 
amino acids. In preferred embodiments, the last special amino acid is operably linked 
to a peptide of about 100 to 170 amino acids or about 142-163 amino acids. In 
particular embodiments, the last special amino acid is operably linked to a peptide of 
about 142 amino acids, or about 163 amino acids, or about 170 amino acids. For 

15 example, the polypeptide comprises a sequence that is at least 95% identical (and 
preferably 100% identical) to an aspartyl protease sequence as described herein. In 
one particular embodiment, the second set of special amino acids is comprised of the 
peptide with the amino acid sequence DSG. Optionally, the amino acid polypeptide 
is operably linked to a peptide purification tag, such as purification tag which is six 

20 histidine. In one variation, the first set of special amino acids are on one polypeptide 
and the second set of special amino acids are on a second polypeptide, where both 
first and second polypeptide have at lease 50 amino acids, which may be any amino 
acids. In one embodiment of this type, both of the polypeptides are in the same 
vessel. The invention further includes a process of making any of the polynucleotides, 

25 vectors, or cells described herein; and a process of making any of the polypeptides 
described herein. 

In yet another related aspect, the invention provides a purified polynucleotide 
comprising a nucleotide sequence that encodes a polypeptide having aspartyl protease 
activity, wherein the polypeptide has an amino acid sequence characterized by: (a) a 
30 first tripeptide sequence DTG; (b) a second tripeptide sequence selected from the 
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group consisting of DSG and DTG; and (c) about 100 to 300 amino acids separating 
the first and second tripeptide sequences, wherein the polypeptide cleaves the beta 
secretase cleavage site of amyloid protein precursor. In one embodiment, the 
polypeptide comprises an amino acid sequence depicted in SEQ ID NO: 2 or 4, 
5 whereas in another embodiment, the polypeptide comprises an amino acid sequence 
other than the amino acid sequences set forth in SEQ ED NOs: 2 and 4. Similarly, the 
invention provides a purified polynucleotide comprising a nucleotide sequence that 
encodes a polypeptide that cleaves the beta secretase cleavage site of amyloid protein 
precursor; wherein the polynucleotide includes a strand that hybridizes to one or more 

10 of SEQ ID NOs: 3, 5, and 7 under the following hybridization conditions: 

hybridization overnight at 42°C for 2.5 hours in 6 X SSC/0.1% SDS, followed by 
washing in 1.0 X SSC at 65°C, 0.1% SDS. In one embodiment, the polypeptide 
comprises an amino acid sequence depicted in SEQ ID NO: 2 or 4, whereas in another 
embodiment, the polypeptide comprises an amino acid sequence other than the amino 

1 5 acid sequences set forth in SEQ ID NOs: 2 and 4. Likewise, the invention provides a 
purified polypeptide having aspartyl protease activity, wherein the polypeptide is 
encoded by polynucleotides as described in the preceding sentences. The invention 
also provides a vector or host cell comprising such polynucleotides, and a method of 
making the polypeptides using the vectors or host cells to recombinantly express the 

20 polypeptide. 

In yet another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide, said polynucleotide encoding a Hu-Asp polypeptide and 
having a nucleotide sequence at least 95% identical to a sequence selected from the 
group consisting of: 

25 (a) a nucleotide sequence encoding a Hu-Asp polypeptide selected 

from the group consisting of Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b), wherein said 
Hu-Asp 1, Hu-Asp2(a) and Hu-Asp2(b) polypeptides have the complete amino acid 
sequence of SEQ ID NO. 2, SEQ ID NO. 4, and SEQ ID NO. 6, respectively; and 
(b) a nucleotide sequence complementary to the nucleotide 

30 sequence of (a). 
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Several species are particularly contemplated. For example, the invention 
provides a nucleic acid and molecule wherein said Hu-Asp polypeptide is Hu-Aspl, 
and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of SEQ 
ED NO. 1 ; and a nucleic acid molecule wherein said Hu-Asp polypeptide is 
5 Hu-Asp2(a), and said polynucleotide molecule of 1(a) comprises the nucleotide 
sequence of SEQ ID NO. 4; and a nucleic acid molecule wherein said Hu-Asp 
polypeptide is Hu-Asp2(b), and said polynucleotide molecule of 1(a) comprises the 
nucleotide sequence of SEQ ED NO. 5. In addition to the foregoing, the invention 
provides an isolated nucleic acid molecule comprising polynucleotide which 

1 0 hybridizes under stringent conditions to a polynucleotide having the nucleotide 
sequence in (a) or (b) as described above. 

Additionally, the invention provides a vector comprising a nucleic acid 
molecule as described in the preceding paragraph. In a preferred embodiment, the 
nucleic acid molecule is operably linked to a promoter for the expression of a Hu-Asp 

15 polypeptide. Individual vectors which encode Hu-Aspl, and Hu-Asp2(a), and 

Hu-Asp2(b) are all contemplated. Likewise, the invention contemplates a host cell 
comprising any of the foregoing vectors, as well as a method of obtaining a Hu-Asp 
polypeptide comprising culturing such a host cell and isolating the Hu-Asp 
polypeptide. Host cells of the invention include bacterial cells, such as E. coli, and 

20 eukaryotic cells. Among the eukaryotic cells that are contemplated are insect cells, 

such as sf9 or High 5 cells; and mammalian cells, such as human, rodent, lagomorph, 
and primate. Preferred human cells include HEK293, and IMR-32 cells. Other 
preferred mammalian cells include COS-7, CHO-K1, Neuro-2A, and 3T3 cells. Also 
among the eukaryotic cells that are contemplated are a yeast cell and an avian cell. 

25 In a related aspect, the invention provides an isolated Hu-Aspl polypeptide 

comprising an amino acid sequence at least 95% identical to a sequence comprising 
the amino acid sequence of SEQ ID NO. 2. The invention also provides an isolated 
Hu-Asp2(a) polypeptide comprising an amino acid sequence at least 95% identical to 
a sequence comprising the amino acid sequence of SEQ ID NO. 4. The invention also 

30 provides an isolated Hu-Asp2(a) polypeptide comprising an amino acid sequence at 
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least 95% identical to a sequence comprising the amino acid sequence of SEQ ED NO. 
8. 

In still another aspect, the invention provides an isolated antibody that binds 
specifically to any Hu-Asp polypeptide described herein, especially the polypeptide 
5 described in the preceding paragraphs. 

The invention also provides several assays involving aspartyl protease 
enzymes of the invention. For example, the invention provides 

a method to identify a cell that can be used to screen for inhibitors of P 
secretase activity comprising: 
1 0 (a) identifying a cell that expresses a protease capable of cleaving APP at 

the p secretase site, comprising: 

i) collect the cells or the supernatant from the cells to be 

identified 

ii) measure the production of a critical peptide, where the critical 
15 peptide is selected from the group consisting of either the APP C-terminal peptide or 

soluble APP, 

iii) select the cells which produce the critical peptide. 

In one variation, the cells are collected and the critical peptide is the APP 
C-terminal peptide created as a result of the p secretase cleavage. In another 

20 variation, the supernatant is collected and the critical peptide is soluble APP, where 
the soluble APP has a C-terminus created by P secretase cleavage. In preferred 
embodiments, the cells contain any of the nucleic acids or polypeptides described 
above and the cells are shown to cleave the P-secretase site of any peptide having the 
following peptide structure, P2, PI, Pl \ P2\ where P2 is K or N, where PI is M or 

25 L, where PT is D, where P2' is A. The method where P2 is K and PI is M. The 
method where P2 is N and PI is L. 

In still another aspect, the invention provides novel isoforms of amyloid 
protein precursor (APP) where the last two carboxy terminus amino acids of that 
isoform are both lysine residues. In this context, the term "isofonrr is defined as any 

30 APP polypeptide, including APP variants (including mutations), and APP fragments 
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that exists in humans, such as those described in US 5,766,846, col 7, lines 45-67, 
incorporated into this document by reference, modified as described herein by the 
inclusion of two C-terminal lysine residues. For example, the invention provides a 
polypeptide comprising the isoform known as APP695, modified to include two 
5 lysine residues as its last two carboxy terminus amino acids. An exemplary 

polypeptide comprises the amino acid sequence set forth in SEQ ID NO. 16. The 
invention further includes APP isoform variants as set forth in SEQ ID NOs. 18 and 
20. The invention further includes all polynucleotides that encode an APP protein 
that has been modified to include two C-terminal lysines; as well has any eukaryotic 

10 cell line comprising such nucleic acids or polypeptides. Preferred cell lines include a 
mammalian cell line {e.g., HEK293, Neuro2a). 

Thus, in one embodiment, the invention provides a polypeptide comprising the 
amino acid sequence of a mammalian amyloid protein precursor (APP) or fragment 
thereof containing an APP cleavage site recognizable by a mammalian P-secretase, 

15 and further comprising two lysine residues at the carboxyl terminus of the amino acid 
sequence of the mammalian APP or APP fragment. As taught herein in detail, the 
addition of two additional lysine residues to APP sequences has been found to greatly 
increase Ap processing of the APP in APP processing assays. Thus, the di-lysine 
modified APP reagents of the invention are particularly useful in assays to identify 

20 modulators of Ap production, for use in designing therapeutics for the treatment or 
prevention of Alzheimer's disease. In one embodiment, the polypeptide comprises 
the complete amino acid sequence of a mammalian amyloid protein precursor (APP), 
and further comprises the two lysine residues at the carboxyl terminus of the amino 
acid sequence of the mammalian amyloid protein precursor. In an alternative 

25 embodiment, the polypeptide comprises only a fragment of the APP, the fragment 

containing at least that portion of APP that is cleaved by a mammalian P-secretase (or 
cc-secretase or y-secretase) in the formation of Ap peptides. 

The practice of assays that monitor cleavage of APP can be facilitated by 
attaching a marker to a portion of the APP. Measurment of retained or liberated 

30 marker can be used to quantitate the amount of APP cleavage that occurs in the assay, 
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e.g., in the presence or absence of a putative modulator of cleavage activity. Thus, in 
one preferred embodiment, the polypeptide of the invention further includes a marker. 
For example, the marker comprises a reporter protein amino acid sequence attached to 
the APP amino acid sequence. Exemplary reporter proteins include a fluorescing 
5 protein (e.g., green fluorescing proteins, luciferase) or an enzyme that is used to 

cleave a substrate to produce a colorimetric cleavage product. Also contemplated are 
tag sequences which are commonly used as epitopes for quantitative immunoassays. 

In a preferred embodiment, the di-lysine-modified APP of the invention is a 
human APP. For example, human APP isoforms such as APP695, APP751, and 

10 APP770, modified to include the two lysines, are contemplated. In a preferred 
embodiment, the APP isoform comprises at least one variation selected from the 
group consisting of a Swedish KM-NL mutation and a London V717-F mutation, or 
any other mutation that has been observed in a subpopulation that is particularly prone 
to development of Alzheimer's disease. These mutations are recognized as mutations 

15 that influence APP processing into Ap. In a highly preferred embodiment, the APP 
protein or fragment thereof comprises the APP-Sw P-secretase peptide sequence 
NLDA, which is associated with increased levels of Ap processing and therefore is 
particularly useful in assays relating to Alzheimer's research. More particularly, the 
APP protein or fragment thereof preferably comprises the APP-Sw p-secretase peptide 

20 sequence SEVNLDAEFR (SEQ ID NO: 63). 

In one preferred embodiment, the APP protein or fragment thereof further 
includes an APP transmembrane domain carboxy-terminal to the APP-Sw p-secretase 
peptide sequence. Polypeptides that include the TM domain are particularly useful in 
cell-based APP processing assays. In contrast, embodiments lacking the TM domain 

25 are useful in cell-free assays of APP processing. 

In addition to working with APP from humans and various animal models, 
researchers in the field of Alzheimer's also have construct chimeric APP polypeptides 
which include stretches of amino acids from APP of one species (e.g., humans) fused 
to streches of APP from one or more other species (e.g., rodent). Thus, in another 

30 embodiment of the polypeptide of the invention, the APP protein or fragment thereof 
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comprises a chimeric APP, the chimeric APP including partial APP amino acid 
sequences from at least two species. A chimeric APP that includes amino acid 
sequence of a human APP and a rodent APP is particularly contemplated. 

In a related aspect, the invention provides a polynucleotide comprising a 
5 nucleotide sequence that encodes a polypeptide as described in the preceding 
paragraphs. Such a polynucleotide is useful for recominant expression of the 
polypeptide of the invention for use in APP processing assays. In addition, the 
polynucleotide is useful for transforming into cells to produce recombinant cells that 
express the polypeptide of the invention, which cells are useful in cell-based assays to 

10 identify modulators of APP processing. Thus, in addition to polynucleotides, the 

invention provides a vector comprising such polynucleotides, especially expression 
vectors where the polynucleotide is operably linked to a promoter to promote 
expression of the polypeptide encoded by the polynucleotide in a host cell. The 
invention further provides a host cell transformed or transfected with such a 

1 5 polynucleotide or a vector. Among the preferred host cells are mammalian cells, 
especially human cells. 

In another, related embodiment, the invention provides a polypeptide useful 
for assaying for modulators of P-secretase activity, said polypeptide comprising an 
amino acid sequence of the formula NH 2 -X-Y-Z-KK-COOH; wherein X, Y, and Z 

20 each comprise an amino acid sequence of at least one amino acid; wherein-NH 2 -X 
comprises an amino-terminal amino acid sequence having at least one amino acid 
residue; wherein Y comprises an amino acid sequence of a p-secretase recognition site 
of a mammalian amyloid protein precursor (APP); and wherein Z-KK-COOH 
comprises a carboxy-terminal amino acid sequence ending in two lysine (K) residues. 

25 In one preferred variation, the carboxyl-terminal amino acid sequence Z includes a 
hydrophobic domain that is a transmembrane domain in host cells that express the 
polypeptide. Host cells that express such a polypeptide are particularly useful in 
assays described herein for identifying modulators of APP processing. In another 
preferred variation, the amino-terminal amino acid sequence X includes an amino acid 

30 sequence of a reporter or marker protein, as described above. In still another preferred 
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variation, the p-secretase recognition site Y comprises the human APP-Sw p-secretase 
peptide sequence NLDA. It will be apparent that these preferred variations are not 
mutually exclusive of each other — they may be combined in a single polypeptide. 
The invention further provides a polynucleotide comprising a nucleotide sequence that 
5 encodes such polypeptides, vectors which comprise such polynucleotides, and host 
cells which comprises such vectors, polynucleotides, and/or polypeptides. 

In yet another aspect, the invention provides a method for identifying 
inhibitors of an enzyme that cleaves the beta secretase cleavable site of APP 
comprising: 

10 a) culturing cells in a culture medium under conditions in which the 

enzyme causes processing of APP and release of amyloid beta-peptide into the 
medium and causes the accumulation of CTF99 fragments of APP in cell lysates, 

b) exposing the cultured cells to a test compound; and specifically 
determining whether the test compound inhibits the function of the enzyme by 

15 measuring the amount of amyloid beta-peptide released into the medium and/or the 
amount of CTF99 fragments of APP in cell lysates; 

c) identifying test compounds diminishing the amount of soluble amyloid 
beta peptide present in the culture medium and diminution of CTF99 fragments of 
APP in cell lysates as Asp2 inhibitors. In preferred embodiments, the cultured cells 

20 are a human, rodent or insect cell line. It is also preferred that the human or rodent 
cell line exhibits p secretase activity in which processing of APP occurs with release 
of amyloid beta-peptide into the culture medium and accumulation of CTF99 in cell 
lysates. Among the contemplated test compounds are antisense oligomers directed 
against the enzyme that exhibits P secretase activity, which oligomers reduce release 

25 of soluble amyloid beta-peptide into the culture medium and accumulation of CTF99 
in cell lysates. 

In yet another aspect, the invention provides a method for the identification of 
an agent that decreases the activity of a Hu-Asp polypeptide selected from the group 
consisting of Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b), the method comprising: 
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a) determining the activity of said Hu-Asp polypeptide in the presence of 
a test agent and in the absence of a test agent; and 

b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide determined in 

5 the absence of said test agent; whereby a lower level of activity in the presence of said 
test agent than in the absence of said test agent indicates that said test agent has 
decreased the activity of said Hu-Asp polypeptide. 

In a related aspect, the invention provides a method for assaying for 
modulators of P-secretase activity, comprising the steps of: 

10 (a) contacting a first composition with a second composition both in the 

presence and in the absence of a putative modulator compound, wherein the first 
composition comprises a mammalian p-secretase polypeptide or biologically active 
fragment thereof, and wherein the second composition comprises a substrate 
polypeptide having an amino acid sequence comprising a P-secretase cleavage site; 

15 (b) measuring cleavage of the substrate polypeptide in the presence and in the absence 
of the putative modulator compound; and (c) identifying modulators of P-secretase 
activity from a difference in cleavage in the presence versus in the absence of the 
putative modulator compound. A modulator that is a P-secretase antagonist 
(inhibitor) reduces such cleavage, whereas a modulator that is a P-secretase agonist 

20 increases such cleavage. Since such assays are relevant to development of 

Alzheimer's disease therapeutics for humans, it will be readily apparent that, in one 
preferred embodiment, the first composition comprises a purified human Asp2 
polypeptide. In one variation, the first composition comprises a soluble fragment of a 
human Asp2 polypeptide that retains Asp2 P-secretase activity. Several such 

25 fragments (including ATM fragments) are described herein in detail. Thus, in a 
particular embodiment, the soluble fragment is a fragment lacking an Asp2 
transmembrane domain. Assaying to identify inhibitors of Aspl P-secretase activity 
also is contemplated. 

The P-secretase cleavage site in APP is known, and it will be 

30 appreciated that the assays of the invention can be performed with either intact APP or 
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fragments or analogs of APP that retain the P-secretase recognition and cleavage site. 
Thus, in one variation, the substrate polypeptide of the second composition comprises 
the amino acid sequence SEVNLDAEFR, which includes the P-secretase recognition 
site of human APP that contains the "Swiss" mutation. In another variation, the 
5 substrate polypeptide of the second composition comprises the amino acid sequence 
EVKMDAEF. In another variation, the second composition comprises a polypeptide 
having an amino acid sequence of a human amyloid precursor protein (APP). For 
example, the human amyloid precursor protein is selected from the group consisting 
of: APP695, APP751, and APP770. Preferably, the human amyloid precursor protein 

10 (irrespective of isoform selected) includes at least on mutation selected from a 

KM-NL Swiss mutation and a V-*F London mutation. As explained elsewhere, one 
preferred embodiment involves a variation wherein the polypeptide having an amino 
acid sequence of a human APP further comprises an amino acid sequence comprising 
a marker sequence attached amino-terminal to the amino acid sequence of the human 

15 amyloid precursor protein. Preferably, the polypeptide having an amino acid 

sequence of a human APP further comprises two lysine residues attached to the 
carboxyl terminus of the amino acid sequence of the human APP. The assays can be 
performed in a cell free setting, using cell-free enzyme and cell-free substrate, or can 
be performed in a cell-based assay wherein the second composition comprises a 

20 eukaryotic cell that expresses amyloid precursor protein (APP) or a fragment thereof 
containing a P-secretase cleavage site. Preferably, the APP expressed by the host cell 
is an APP variant that includes two carboxyl-terminal lysine residues. It will also be 
appreciated that the P-secretase enzyme can be an enzyme that is expressed on the 
surface of the same cells. 

25 The present invention provides isolated nucleic acid molecules comprising a 

polynucleotide that codes for a polypeptide selected from the group consisting of 
human aspartyl proteases. In particular, human aspartyl protease 1 (Hu-Aspl) and 
two alternative splice variants of human aspartyl protease-2 (Hu-Asp2), a "long" (L) 
form designated herein as Hu-Asp2(a) and a "short" (S) form designated Hu-Asp2(b). 

30 As used herein, all references to "Hu-Asp" should be understood to refer to all of 
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Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b). In addition, as used herein, all references to 
"Hu-Asp2" should be understood to refer to both Hu-Asp2(a) and Hu-Asp2(b). 
Hu-Aspl is expressed most abundantly in pancreas and prostate tissues, while 
Hu-Asp2(a) and Hu-Asp2(b) are expressed most abundantly in pancreas and brain 
5 tissues. The invention also provides isolated Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b) 
polypeptides, as well as fragments thereof which exhibit aspartyl protease activity. 

In a preferred embodiment, the nucleic acid molecules comprise a polynucleotide 
having a nucleotide sequence selected from the group consisting of residues 1-1554 of 
SEQ ID NO. 1, encoding Hu-Aspl, residues 1-1503 of SEQ ID NO. 3, encoding 

10 Hu-Asp2(a), and residues 1-1428 of SEQ ID NO.5, encoding Hu-Asp2(b). In another 
aspect, the invention provides an isolated nucleic acid molecule comprising a 
polynucleotide which hybridizes under stringent conditions to a polynucleotide encoding 
Hu-Aspl , Hu-Asp2(a), Hu-Asp-2(b), or fragments thereof. European patent application 
EP 0 848 062 discloses a polypeptide referred to as "Asp 1," that bears substantial 

15 homology to Hu-Aspl, while international application WO 98/22597 discloses a 
polypeptide referred to as "Asp 2," that bears substantial homology to Hu-Asp2(a). 

The present invention also provides vectors comprising the isolated nucleic acid 
molecules of the invention, host cells into which such vectors have been introduced, and 
recombinant methods of obtaining a Hu-Aspl , Hu-Asp2(a), or Hu-Asp2(b) polypeptide 

20 comprising culturing the above-described host cell and isolating the relevant polypeptide. 

In another aspect, the invention provides isolated Hu-Aspl, Hu-Asp2(a), and 
Hu-Asp2(b) polypeptides, as well as fragments thereof. In a preferred embodiment, the 
Hu-Aspl , Hu-Asp2(a), and Hu-Asp2(b) polypeptides have the amino acid sequence given 
in SEQ ID NO. 2, SEQ ID NO. 4, or SEQ ID NO.6, respectively. The present invention 

25 also describes active forms of Hu-Asp2, methods for preparing such active forms, 
methods for preparing soluble forms, methods for measuring Hu-Asp2 activity, and 
substrates for Hu-Asp2 cleavage. The invention also describes antisense oligomers 
targeting the Hu-Aspl, Hu-Asp2(a) and Hu-Asp2(b) mRNA transcripts and the use of 
such antisense reagents to decrease such mRNA and consequently the production of the 

30 corresponding polypeptide. Isolated antibodies, both polyclonal and monoclonal, that 
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binds specifically to any of the Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b) polypeptides of 
the invention are also provided. 

The invention also provides a method for the identification of an agent that 
modulates the activity of any of Hu-Asp-1 , Hu-Asp2(a), andHu-Asp2(b). The inventions 
5 describes methods to test such agents in cell-free assays to which Hu-Asp2 polypeptide 
is added, as well as methods to test such agents in human or other mammalian cells in 
which Hu-Asp2 is present. 

The invention provides for methods for assaying for human Aspl(hu-Aspl) a- 
secretase activity comprising contacting the hu-Aspl protein with an amyloid precursor 

1 0 protein (APP) substrate, wherein the substrate contains an a-secretase cleavage site; and 
measuring cleavage of the APP substrate at the a-secretase cleavage site, thereby 
assaying hu-Aspl a-secretase activity. An example of a-secretase activity is APP 
processing wherein the APP substrate is cleaved at a site adjacent to the cell membrane 
(at residues Phe 20 iAla 21 in relation to the Ap peptide). This cleavage results in the release 

15 of a soluble, extracellular domain of APP, known as amyloid alpha peptide (sAPPa), 
from the cell surface into the cytoplasm. The sAPPa within the cytoplasm can be 
detected and quantitated thereby measuring a-secretase activity. 

The hu-Aspl enzyme used in the methods of the invention can be purified and 
isolated from a cell which is transfected or transformed with a polynucleotide that 

20 encodes hu- Asp 1 , such as SEQ ID NO: 1 , or a polynucleotide sequence that encodes the 
the amino acid sequence of SEQ ID NO: 2. Further, the hu-Aspl protein used in the 
methods may be a fragment of the amino acid sequence of SEQ ID NO: 2 which retains 
a-secretase activity. Possible fragments that may be of use for the methods include those 
lacking the transmembrane domain amino acids 469-492 of SEQ ID NO: 2, those 

25 fragments that lack the cytoplasmic amino acids 493-492 of SEQ ID NO: 2, those 
fragments that lack the amino terminal amino acids 1-62 of SEQ TD NO: 2 or 
combinations thereof. 

The invention also encompasses methods of assaying for a-secretase activity 
where hu-Aspl protein and its substrate are brought into contact by a growing cell 

30 transfected or transformed with a polynucleotide encoding the hu-Aspl protein or a 
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fragment thereof that retains cc-secretase activity under conditions where the cell 
expresses hu-Aspl protein in the presence of the APP substrate. The APP substrate in 
such circumstances can be exogenously introduced, or more preferably, is expressed by 
the cell that expresses Asp 1 . These methods also encompass contacting hu-Asp 1 protein 
5 with a cell that expresses a polynucleotide that encodes an APP substrate containing an 
a-secretase cleavage site. For example, the cell may express a polynucleotide that 
encodes a polypeptide having an cc-secretase cleavage site comprising the amino acid 
sequence LVFFAEDF or KLVFFAED. In addition, the APP substrate may comprise any 
human isoform of APP, such as "normal" APP (APP695), APP 75 1 , or APP770. These 

10 APP substrates can be further modified to comprise a carboxy-terminal di-lysine motif. 

To measure the cleavage of the substrates for the methods of assaying for a- 
secretase activity of the invention, the substrates of the method can be further modified 
to comprise detectable labels such as radioactive, enzymatic, chemilumenescent or 
flourescent labels. In particular, shorter peptide substrates preferably comprise internally 

15 quenched labels that result in increased detectability after cleavage of the peptide 
substrates. The peptide substrates may be modified to have attached a paired flurophore 
and quencher including but not limited to 7-amino-4-methyl coumarin and dinitrophenol, 
respectively, such that cleavage of the peptide by the hu-Aspl results in increased 
fluorescence due to physical separation of the flurophore and quencher. Other paired 

20 flurophores and quenchers include bodipy-tetramethylrhodamine and QS Y-5 (Molecular 
Probes, Inc.) In a variant of this assay, biotin or another suitable tag may be placed on 
one end of the peptide to anchor the peptide to a substrate assay plate and a flurophore 
may be placed at the other end of the peptide. Useful flurophores include those listed 
above as well as Europium labels such as W8044 (EG&G Wallac, Inc.). A preferred 

25 label is Oregon green that may be attached to a Cys residue. Cleavage of the peptide by 
Aspl will release the flurophore or other tag from the plate, allowing compounds to be 
assayed for inhibition of Aspl proteolytic cleavage as shown by an increase in retained 
fluorescence. Preferred colorimetric assays of hu-Aspl proteolytic activity utilize other 
suitable substrates that include the P 2 and P, amino acids comprising the recognition site 

30 for cleavage linked to o-nitrophenol through an amide linkage, such that cleavage by the 
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hu-Aspl results in an increase in optical density after altering the assay buffer to alkaline 
pH. 

The prevent invention also provides for methods of assaying for a-secretase 
activity comprising contacting hu-Aspl protein with an APP substrate, determining the 
5 level of hu-Asp 1 a-secretase activity in the presence and absence of a modulator of hu- 
Aspl a-secretase activity and comparing the hu-Aspl secretase activity in the presence 
and absence of the modulator. The modulators determined to increase hu-Aspl a- 
secretase activity will be identified as candidate Alzheimer's disease therapeutics. The 
invention also encompasses methods which comprise a step for treating Alzheimer's 

10 disease with identified candidate Alzheimer disease therapeutics. The invention also 
provides for compositions comprising a candidate Alzheimer's disease therpeutic 
identified by the a-secretase assaying methods of the invention. Aspl modulators that 
reduce Aspl P-secretase activity and increase Aspl a-secretase activity are highly 
preferred. Assays for Aspl p-secretase activity are preferred essentially as described in 

15 detail herein for Asp2. 

The invention provides for Asp 1 protease substrate peptides or fragments thereof, 
wherein said peptides comprise an amino acid sequence consisting of fifty or fewer 
amino acids which comprise the Aspl cleavage site having the amino acid sequence 
GLALALEP. This peptide was derived from the Aspl amino acid sequence and the 

20 discovery of an apparent Aspl autocatalytic cleavage in acidic conditions. The Aspl 
substrate of the invention may also comprise a detectable label, such as a radioactive 
label, chemiluminescent label, enzymatic label or a flourescent label. The flourescently 
labeled substrate can consist of internally quenched labels as described above. 

The invention also encompasses methods comprising the steps of contacting hu- 

25 Aspl protein with an Aspl substrate under acidic conditions and determining the level 
of Aspl proteolytic activity. An example of Aspl proteolytic activity is the auto-catalytic 
processing hu-Asp undergoes in acidic environments, wherein cleavage occurs at an 
amino acid site surrounding Ala 63 and cleaves the amino terminal amino acids of the hu- 
Aspl pro-peptide. The hu-Aspl pro-peptide refers to a secreted form of Aspl that has 

30 completed intercellular processing which resulted in cleavage of its signal sequence. 
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For the methods of assaying Aspl proteolytic activity, the hu-Aspl may be 
produced in a cell transformed or transfected with a polynucleotide that encodes hu-Aspl . 
The hu-Aspl protein may be isolated and purified from these cells or the method may 
utilize a cell growing under conditions that it expresses hu-Aspl . The method may also 
5 be carried out with a fragment of hu-Aspl that retains its proteolytic activity. The 
fragments provided for by the invention include hu-Aspl polypeptide sequences which 
lack the amino acids that encode a transmembrane domain such as amino acids 469-492 
of SEQ ID NO: 2 or fragments that lacks the cytoplasmic domain such as amino acids 
493-518ofSEQIDNO:2. 

10 The invention provides for a purified polynucleotide comprising a nucleotide 

sequence encoding a polypeptide that comprises a fragment of a hu-Aspl protein, 
wherein said nucleotide sequence lacks the sequence that encodes amino acids 23-62 of 
SEQ ID NO: 2, and wherein the polypeptide has hu-Aspl a-secretase activity. This 
portion of the Aspl sequence corresponds to a punitive pro-peptide that is removed, 

1 5 apparently through autocatalysis, under acidic conditions. These polypeptide fragments 
also include those lacking the amino terminal amino acids 1-62 of SEQ ID NO: 2. 

The invention encompasses a purified polynucleotide comprising a nucleotide 
sequence that hybridizes under stringent conditions to the non-coding strand 
complementary to SEQ ID NO: 1, wherein the nucleotide sequence encodes a 

20 polypeptide having Aspl proteolytic activity and wherein the polynucleotide lacks 
nucleotides encoding a transmembrane domain. Further, the invention also provides for 
a purified polynucleotide sequence comprising a nucleotide sequence that hybridizes 
under stringent conditions to the non-coding strand complementary to SEQ ID NO: 1 , , 
wherein the nucleotide sequence encodes a polypeptide further lacking a pro-peptide 

25 domain corresponding to amino acids 23-62 of SEQ ID NO: 2. 

The invention also provides for vectors comprising the hu-Aspl polynucleotide 
of the invention and host cells transfected or transformed with these vectors. The 
invention also encompasses host cells transfected or transformed with the hu-Aspl 
polynucleotides of the invention. 



-23- 



WO 01/23533 



PCT/US00/26080 



Another embodiment of the invention is a purified polypeptide comprising a 
fragment of a hu-Aspl protein, wherein said polypeptide lacks the hu-Aspl 
transmembrane domain and retains hu-Aspl a-secretase activity. These polypeptides 
include fragments of hu-Aspl having the amino acid sequence set forth in SEQ ID NO: 
5 2, and wherein the polypeptide optionally lacks the transmembrane domain amino acids 
469-492 of SEQ ID NO: 2, wherein the polypeptide lacks the cytoplasmic domain amino 
acids 493-518 of SEQ ED NO: 2 as well. In one variation, the invention provides a 
polypeptide that lacks amino terminal amino acids 1-62 of SEQ ID NO: 2 but retains 
Aspl proteolytic activity. Fragments lacking both the aforementioned amino-terminal 

10 and carboxy terminal residues are contemplated. 

The invention provides for a polypeptide comprising a fragment of hu-Aspl 
having the amino acid sequence set forth in SEQ DD NO: 2 and wherein said polypeptide 
lacks the amino terminal amino acids 1-62 and retains APP processing activity. For 
example, referring to the Aspl sequence in SEQ ID NO: 2, this pre-pro portion would 

15 correspond to residues 22 to 62. By performing conventional sequence analysis, the 
corresponding portions of the Aspl sequence can also be identified. 

Another embodiment of the invention i s a polypeptide comprising the amino acid 
sequence at least 95% identical to a fragment of hu-Aspl protein, wherein said 
polypeptide and said fragment lack a transmembrane domain and retain hu-Aspl a- 

20 secretase activity. In addition, the invention embodies a polypeptide comprising an 
amino acid sequence at least 95% identical to a fragment of hu-Aspl protein, wherein 
said polypeptide and said fragment lack the amino terminal amino acids corresponding 
to the pre-pro portion of hu-Aspl and retain APP processing activity. 

Additional features and variations of the invention will be apparent to those 

25 skilled in the art from the entirety of this application, including the drawing and detailed 
description, and all such features are intended as aspects of the invention. Likewise, 
features of the invention described herein can be re-combined into additional 
embodiments that are also intended as aspects of the invention, irrespective of whether 
the combination of features is specifically mentioned above as an aspect or embodiment 

30 of the invention. Also, only such limitations which are described herein as critical to the 
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invention should be viewed as such; variations of the invention lacking limitations which 
have not been described herein as critical are intended as aspects of the invention. 

In addition to the foregoing, the invention includes, as an additional 
aspect, all embodiments of the invention narrower in scope in any way than the variations 
5 specifically mentioned above. Although the applicant(s) invented the full scope of the 
claims appended hereto, the claims appended hereto are not intended to encompass 
within their scope the prior art work of others. Therefore, in the event that statutory prior 
art within the scope of a claim is brought to the attention of the applicants by a Patent 
Office or other entity or individual, the applicant(s) reserve the right to exercise 
10 amendment rights under applicable patent laws to redefine the subject matter of such a 
claim to specifically exclude such statutory prior art or obvious variations of statutory 
prior art from the scope of such a claim. Variations of the invention defined by such 
amended claims also are intended as aspects of the invention. 



1 5 BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

Sequence ED No. 1 : Human Asp-1 , nucleotide sequence. 

Sequence ED No. 2: Human Asp-1, predicted amino acid sequence. 

Sequence ED No. 3: Human Asp-2(a), nucleotide sequence. 

Sequence ED No. 4: Human Asp-2(a), predicted amino acid sequence. The 
20 Asp2(a) amino acid sequence includes a putative signal peptide comprising residues 1 
to 21 ; and a putative pre-propeptide after the signal peptide that extends through 
residue 45 (as assessed by processing observed of recombinant Asp2(a) in CHO 
cells), and a putative propeptide that may extend to at least about residue 57, based on 
the observation of an observed GRR1GS sequence which has characteristics of a 
25 protease recognition sequence. The Asp2(a) further includes a transmembrane 

domain comprising residues 455-477, a cytoplasmic domain comprising residues 478- 
501, and a putative alpha-helical spacer region, comprising residues 420-454, believed 
to be unnecessary for proteolytic activity, between the protease catalytic domain and 
the transmembrane domain. 
30 Sequence ID No. 5: Human Asp-2(b), nucleotide sequence. 
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Sequence ID No. 6: Human Asp-2(b), predicted amino acid sequence. The 
Asp2(b) amino acid sequence includes a putative signal peptide, pre-propeptide, and 
propeptide as described above for Asp2(a). The Asp2(b) further includes a 
transmembrane domain comprising residues 430-452, a cytoplasmic domain 
5 comprising residues 453-476, and a putative alpha-helical spacer region, comprising 
residues 395-429, believed to be unnecessary for proteolytic activity, between the 
protease catalytic domain and the transmembrane domain. 

Sequence ID No. 7: Murine Asp-2(a), nucleotide sequence. 

Sequence ID No. 8: Murine Asp-2(a), predicted amino acid sequence. The 
10 proteolytic processing of murine Asp2(a) is believed to be analogous to the processing 
described above for human Asp2(a). In addition, a variant lacking amino acid 
residues 190-214 of SEQ ID NO: 8 is specifically contemplated as a murine Asp2(b) 
polypeptide. 

Sequence ID No. 9: Human APP695, nucleotide sequence. 
15 Sequence ID No. 10: Human APP695, predicted amino acid sequence. 

Sequence ID No. 1 1 : Human APP695-Sw, nucleotide sequence. 
Sequence ID No. 12: Human APP695-Sw. predicted amino acid sequence. In 
the APP695 isoform, the Sw mutation is characterized by a KM-NL alteration at 
positions 595-596 (compared to normal APP695). 
20 Sequence ID No. 13: Human APP695-VF, nucleotide sequence. 

Sequence ID No.l 4: Human APP695-VF, predicted amino acid sequence. In 
the APP 695 isoform, the VF mutation is characterized by a V-F alteration at position 
642 (compared to normal APP 695). 

Sequence ID No.l 5: Human APP695-KK, nucleotide sequence. 
25 Sequence ID No. 16: Human APP695-KK, predicted amino acid sequence. 

(APP695 with two carboxy-terminal lysine residues.) 

Sequence ID No.l 7: Human APP695-Sw-KK, nucleotide sequence. 
Sequence ID No. 18: Human APP695-Sw-KK, predicted amino acid sequence 
Sequence ID No. 19: Human APP695-VF-KK, nucleotide sequence 
30 Sequence ID No.20: Human APP695-VF-KK, predicted amino acid sequence 
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Sequence ID No.2 1 : T7-Human-pro-Asp-2(a)ATM, nucleotide sequence 
Sequence ID No.22: T7-Human-pro-Asp-2(a)ATM, amino acid sequence 
Sequence ID No.23: T7-Caspase-Human-pro-Asp-2(a)ATM, nucleotide 
sequence 

5 Sequence ID No.24: T7-Caspase-Human-pro-Asp-2(a)ATM, amino acid 

sequence 

Sequence ID No.25 : Human-pro- Asp-2(a)ATM (low GC), nucleotide 
sequence 

Sequence ID No.26: Human-pro-Asp-2(a)ATM, (low GC), amino acid 
10 sequence 

Sequence ID No.27: T7-Caspase-Caspase 8 
cleavage-Human-pro-Asp-2(a)ATM, nucleotide sequence 

Sequence ID No.28: T7-Caspase-Caspase 8 
cleavage-Human-pro-Asp-2(a)ATM, amino acid sequence 
15 Sequence ID No.29: Human Asp-2(a)ATM, nucleotide sequence 

Sequence ID No.30: Human Asp-2(a)ATM, amino acid sequence 
Sequence ID No.3 1 : Human Asp-2(a)ATM(His) 6 , nucleotide sequence 
Sequence ID No. 32: Human Asp-2(a)ATM(His) 6 , amino acid sequence 
Sequence ID Nos. 33-49 are short synthetic peptide and oligonucleotide 
20 sequences that are described below in the Detailed Description of the Invention. 

Sequence ID No. 50: Human Asp2(b)ATM polynucleotide sequence. 
Sequence ID No. 5 1 : Human Asp2(b)ATM polypeptide sequence (exemplary 
variant of Human Asp2(b) lacking transmembrane and intracellular domains of Hu- 
Asp2(b) set forth in SEQ ID NO: 6. 
25 Sequence ID No. 52: Human Asp2(b)ATM(His) 6 polynucleotide sequence. 

Sequence ID No. 53: Human Asp2(b)ATM(His) 6 polypeptide sequence 
(Human Asp2(b)ATM with six histidine tag attached to C-terminus) 

Sequence ID No. 54: Human APP770-encoding polynucleotide sequence. 
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Sequence ED No. 55: Human APP770 polypeptide sequence. To introduce the 
KM-NL Swedish mutation, residues KM at positions 670-71 are changed to NL. To 
introduce the V-F London mutation, the V residue at position 717 is changed to F. 

Sequence ED No. 56: Human APP751 encoding polynucleotide sequence. 
5 Sequence ID No. 57: Human APP751 polypeptide sequence (Human APP751 

isoform). 

Sequence ID No. 58: Human APP770-KK encoding polynucleotide sequence. 

Sequence ID No. 59: Human APP770-KK polypeptide sequence. (Human 
APP770 isoform to which two C-terminal lysines have been added). 
1 0 Sequence ED No. 60: Human APP75 1 -KK encoding polynucleotide sequence. 

Sequence ID No. 61: Human APP751-KK polypeptide sequence (Human 
APP751 isoform to which two C-terminal lysines have been added). 

Sequence ID Nos. 62-65: Various short peptide sequences described in detail 
in detailed description. 
15 Sequence ED No. 66: Predicted amino acid sequence of human Asp- 

1 ATM(His) 6 as described in Example 14. 

Sequence ED No. 67: Amino acid sequence of secreted recombinant Asp- 
lATM(His)6 as described in Example 14. 

Sequence ED No. 68: Amino acid sequence of acid-processed form of 
20 AsplA(His) 6 . 

Sequence ID No. 69: Amino acid sequence of the self-activated acid 
processing site within Asp- 1 ATM. 

Sequence ED No. 70: Amino acid sequence of a peptide that includes the p- 
secretase processing site within the Swedish mutant form of APP. 
25 Sequence ID No. 71 : Amino acid sequence of a peptide (residues 17-24) that 

includes the a-secretase processing site within the AP peptide (Ap i2 . 28 ). 

Sequence ID No. 72: Amino acid sequence of a peptide (residues 16-23) that 
includes the a-secretase processing site within the AP peptide (AP I2 _ 28 ). 

Sequence ID No. 73-74: PCR primers described in Example 14. 
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Sequence ID No. 75: Amino acid sequence of a y-secretase substrate polypeptide 
described in Example 15. 

BRIEF DESCRIPTION OF THE FIGURES 

5 Figure 1 : Figure 1 shows the nucleotide (SEQ ID NO: 1 ) and predicted amino 

acid sequence (SEQ ED NO:2) of human Aspl. 

Figure 2: Figure 2 shows the nucleotide (SEQ ID NO:3) and predicted amino 
acid sequence (SEQ ID NO:4) of human Asp2(a). 

Figure 3 : Figure 3 shows the nucleotide (SEQ ID NO:5) and predicted amino 
1 0 acid sequence (SEQ ID NO:6) of human Asp2(b). The predicted transmembrane domain 
of Hu-Asp2(b) is enclosed in brackets. 

Figure 4: Figure 4 shows the nucleotide (SEQ ED No. 7) and predicted amino 
acid sequence (SEQ ED No. 8) of murine Asp2(a) 

Figure 5 : Figure 5 shows the BestFit alignment of the predicted amino acid 
1 5 sequences of Hu-Asp2(a) and murine Asp2(a) 

Figure 6: Figure 6 shows the nucleotide (SEQ ED No. 21) and predicted 
amino acid sequence (SEQ ID No. 22) of T7-Human-pro-Asp-2(a)ATM 

Figure 7: Figure 7 shows the nucleotide (SEQ ID No. 23) and predicted 
amino acid sequence (SEQ ID No. 24) of T7-caspase-Human-pro-Asp-2(a)ATM 
20 Figure 8: Figure 8 shows the nucleotide (SEQ ID No. 25) and predicted 

amino acid sequence (SEQ ED No. 26) of Human-pro- Asp-2(a)ATM (low GC) 

Figure 9: Western blot showing reduction of CTF99 production by 
HEK125.3 cells transfected with antisense oligomers targeting the Hu-Asp2 mRNA. 

Figure 10: Western blot showing increase in CTF99 production in mouse 
25 Neuro-2a cells cotransfected with APP-KJC with and without Hu-Asp2 only in those cells 
cotransfected with Hu-Asp2. A further increase in CTF99 production is seen in cells 
cotransfected with APP-Sw-KK with and without Hu-Asp2 only in those cells 
cotransfected with Hu-Asp2 

Figure 1 1 : Figure 1 1 shows the predicted amino acid sequence (SEQ ID No. 
30 30) of Human- Asp2(a)ATM 
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Figure 12: Figure 1 1 shows the predicted amino acid sequence (SEQ ED No. 
30) of Human- Asp2(a)ATM(His) 6 

DETAILED DESCRIPTION OF THE INVENTION 

5 A few definitions used in this invention follow, most definitions to be used are 

those that would be used by one ordinarily skilled in the art. 

The term n p amyloid peptide" means any peptide resulting from beta secretase 
cleavage of APP. This includes peptides of 39, 40, 41, 42 and 43 amino acids, extending 
from the p-secretase cleavage site to 39, 40, 41, 42 and 43 amino acids C-terminal to the 

1 0 P-secretase cleavage site, p amyloid peptide also includes sequences 1 -6, SEQ ID NOs. 
1-6 of US 5,750,349, issued 12May 1998 (incorporated into this document by reference). 
A p-secretase cleavage fragment disclosed here is called CTF-99, which extends from 
P-secretase cleavage site to the carboxy terminus of APP. 

When an isoform of APP is discussed then what is meant is any APP 

1 5 polypeptide, including APP variants (including mutations), and APP fragments that 
exists in humans such as those described in US 5,766,846, col 7, lines 45-67, 
incorporated into this document by reference. 

The term "P-amyloid precursor protein" (APP) as used herein is defined as a 
polypeptide that is encoded by a gene of the same name localized in humans on the 

20 long arm of chromosome 21 and that includes "PAP - here "p-amyloid protein" see 
above, within its carboxyl third. APP is a glycosylated, single-membrane spanning 
protein expressed in a wide variety of cells in many mammalian tissues. Examples of 
specific isotypes of APP which are currently known to exist in humans are the 695 
amino acid polypeptide described by Kang et. al. (1987) Nature 325:733-736 which is 

25 designated as the "normal" APP (SEQ TD NOs: 9-10); the 751 amino acid polypeptide 
described by Ponte et al. (1988) Nature 331:525-527 (1988) and Tanzi et al. (1988) 
Nature 331:528-530 (SEQ ID NOs: 56-57); and the 770-amino acid polypeptide 
described by Kitaguchi et. al. (1988) Nature 331:530-532 (SEQ ID NOs: 54-55). 
Examples of specific variants of APP include point mutation which can differ in both 

30 position and phenotype (for review of known variant mutation see Hardy (1992) 
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Nature Genet. 1:233-234). All references cited here incorporated by reference. The 
term "APP fragments" as used herein refers to fragments of APP other than those 
which consist solely of PAP or pAP fragments. That is, APP fragments will include 
amino acid sequences of APP in addition to those which form intact PAP or a 
5 fragment of PAP. 

When the term "any amino acid" is used, the amino acids referred to are to be 
selected from the following, three letter and single letter abbreviations - which may 
also be used, are provided as follows: 

Alanine, Ala, A; Arginine, Arg, R; Asparagine, Asn, N; Aspartic acid, Asp, 

10 D; Cysteine, Cys, C; Glutamine, Gin, Q; Glutamic Acid, Glu, E; Glycine, Gly, G; 
Histidine, His, H; Isoleucine, He, I; Leucine, Leu, L; Lysine, Lys, K; Methionine, 
Met, M; Phenylalanine, Phe, F; Proline, Pro, P; Serine, Ser, S; Threonine, Thr, T; 
Tryptophan, Trp, W; Tyrosine, Tyr, Y; Valine, Val, V; Aspartic acid or Asparagine, 
Asx, B; Glutamic acid or Glutamine, Glx, Z; Any amino acid, Xaa, X. 

15 The present invention describes a method to scan gene databases for the 

simple active site motif characteristic of aspartyl proteases. Eukaryotic aspartyl 
proteases such as pepsin and renin possess a two-domain structure which folds to 
bring two aspartyl residues into proximity within the active site. These are embedded 
in the short tripeptide motif DTG, or more rarely, DSG. Most aspartyl proteases 

20 occur as proenzyme whose N-terminus must be cleaved for activation. The DTG or 
DSG active site motif appears at about residue 65-70 in the proenzyme (prorenin, 
pepsinogen), but at about residue 25-30 in the active enzyme after cleavage of the 
N-terminal prodomain. The limited length of the active site motif makes it difficult to 
search collections of short, expressed sequence tags (EST) for novel aspartyl 

25 proteases. EST sequences typically average 250 nucleotides or less, and so would 

encode 80-90 amino acid residues or less. That would be too short a sequence to span 
the two active site motifs. The preferred method is to scan databases of hypothetical 
or assembled protein coding sequences. The present invention describes a computer 
method to identify candidate aspartyl proteases in protein sequence databases. The 

30 method was used to identify seven candidate aspartyl protease sequences in the 
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Caenorhabditis elegans genome. These sequences were then used to identify by 
homology search Hu-Aspl and two alternative splice variants of Hu-Asp2, designated 
herein as Hu-Asp2(a) and Hu-Asp2(b). 

In a major aspect of the invention disclosed here we provide new information 
5 about APP processing. Pathogeneic processing of the amyloid precursor protein 

(APP) via the AP pathway requires the sequential action of two proteases referred to 
as P-secretase and y-secretase. Cleavage of APP by the P-secretase and y-secretase 
generates the N-terminus and C-terminus of the AP peptide, respectively. Because 
over production of the AP peptide, particularly the AP,^ 2 , has been implicated in the 

10 initiation of Alzheimer's disease, inhibitors of either the P-secretase and/or the 
y-secretase have potential in the treatment of Alzheimer's disease. Despite the 
importance of the p-secretase and y-secretase in the pathogenic processing of APP, 
molecular definition of these enzymes has not been accomplished to date. That is, it 
was not known what enzymes were required for cleavage at either the P-secretase or 

15 the y-secretase cleavage site. The sites themselves were known because APP was 
known and the Ap,^ 2 , peptide was known, see US 5,766,846 and US 5,837,672, 
(incorporated by reference, with the exception to reference to "soluble" peptides). But 
what enzyme was involved in producing the Ap M2 , peptide was unknown. 

Alignment of the amino acid sequences of Hu-Asp2 with other known aspartyl 

20 proteases reveals a similar domain organization. All of the sequences contain a signal 
sequence followed by a pro-segment and the catalytic domain containing 2 copies of 
the aspartyl protease active site motif (DTG/DSG) separated by approximately 180 
amino acid residues. Comparison of the processing site for proteolytic removal of the 
pro-segment in the mature forms of pepsin A, pepsin C, cathepsin D, cathepsin E and 

25 renin reveals that the mature forms of these enzymes contain between 31-35 amino 
acid residues upstream of the first DTG motif. Inspection of this region in the 
Hu-Asp-2 amino acid sequence indicates a preferred processing site within the 
sequence GRRlGS as proteolytic processing of pro-protein precursors commonly 
occurs at site following dibasic amino acid pairs (eg. RR). Also, processing at this 

30 site would yield a mature enzyme with 35 amino acid residues upstream of the first 

-32- 



WO 01/23533 



PCT/US00/26080 



DTG, consistent with the processing sites for other aspartyl proteases. In the absence 
of self-activation of Hu-Asp2 or a knowledge of the endogenous protease that 
processes Hu-Asp2 at this site, a recombinant form was engineered by introducing a 
recognition site for the PreSission protease (LEVLFQ 1 GP) into the expression 
5 plasmids for bacterial, insect cell, and mammalian cell expression of pro-Hu-Asp2. In 
each case, the Gly residue in PT position corresponds to the Gly residue 35 amino 
acids upstream of the first DTG motif in Hu-Asp2. 

The present invention involves the molecular definition of several novel 
human aspartyl proteases and one of these, referred to as Hu-Asp-2(a) and 

10 Hu-Asp2(b), has been characterized in detail. Previous forms of aspl and asp 2 have 
been disclosed, see EP 0848062 A2 and EP 085 5444 A2, inventors David Powel et aL, 
assigned to Smith Kline Beecham Corp. (incorporated by reference). Herein are 
disclosed old and new forms of Hu-Asp 2. For the first time they are expressed in 
active form, their substrates are disclosed, and their specificity is disclosed. Prior to 

1 5 this disclosure cell or cell extracts were required to cleave the P-secretase site, now 

purified protein can be used in assays, also described here. Based on the results of (1) 
antisense knock out experiments, (2) transient transfection knock in experiments, and 
(3) biochemical experiments using purified recombinant Hu-Asp-2, we demonstrate 
that Hu-Asp-2 is the P-secretase involved in the processing of APP. Although the 

20 nucleotide and predicted amino acid sequence of Hu-Asp-2(a) has been reported, see 
above, see EP 0848062 A2 and EP 0855444A2, no functional characterization of the 
enzyme was disclosed. Here the authors characterize the Hu-Asp-2 enzyme and are 
able to explain why it is a critical and essential enzyme required in the formation of 
AP,^ 2 , peptide and possible a critical step in the development of AD. 

25 In another embodiment the present invention also describes a novel splice 

variant of Hu-Asp2, referred to as Hu-Asp-2(b), that has never before been disclosed. 

In another embodiment, the invention provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a polypeptide selected from the group 
consisting of human aspartyl protease 1 (Hu-Aspl) and two alternative splice variants 

30 of human aspartyl protease-2 (Hu-Asp2), designated herein as Hu-Asp2(a) and 
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Hu-Asp2(b). As used herein, all references to "Hu-Asp2" should be understood to 
refer to both Hu-Asp2(a) and Hu-Asp2(b). Hu-Aspl is expressed most abundantly in 
pancreas and prostate tissues, while Hu-Asp2(a) and Hu-Asp2(b) are expressed most 
abundantly in pancreas and brain tissues. The invention also provides isolated 
5 Hu-Aspl , Hu-Asp2(a), and Hu-Asp2(b) polypeptides, as well as fragments thereof 
which exhibit aspartyl protease activity. 

The predicted amino acid sequences of Hu-Aspl, Hu-Asp2(a) and Hu-Asp2(b) 
share significant homology with previously identified mammalian aspartyl proteases 
such as pepsinogen A, pepsinogen B, cathepsin D, cathepsin E, and renin. P.B.Szecs, 
10 Scand. J. Clin. Lab. Invest. J2:(Suppl. 210 5-22 (1992)). These enzymes are 
characterized by the presence of a duplicated DTG/DSG sequence motif. The 
Hu-Aspl and HuAsp2 polypeptides disclosed herein also exhibit extremely high 
homology with the ProSite consensus motif for aspartyl proteases extracted from the 
SwissProt database. 

1 5 The nucleotide sequence given as residues 1 -1 554 of SEQ ID NO: 1 

corresponds to the nucleotide sequence encoding Hu-Aspl, the nucleotide sequence 
given as residues 1-1 503 of SEQ ID NO:3 corresponds to the nucleotide sequence 
encoding Hu-Asp2(a), and the nucleotide sequence given as residues 1-1428 of SEQ 
ID NO:5 corresponds to the nucleotide sequence encoding Hu-Asp2(b). The isolation 

20 and sequencing of DNA encoding Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b) is 
described below in Examples 1 and 2. 

As is described in Examples 1 and 2, automated sequencing methods were 
used to obtain the nucleotide sequence of Hu-Aspl, Hu-Asp2(a), and Hu-Asp-2(b). 
The Hu-Asp nucleotide sequences of the present invention were obtained for both 

25 DNA strands, and are believed to be 100% accurate. However, as is known in the art, 
nucleotide sequence obtained by such automated methods may contain some errors. 
Nucleotide sequences determined by automation are typically at least about 90%, 
more typically at least about 95% to at least about 99.9% identical to the actual 
nucleotide sequence of a given nucleic acid molecule. The actual sequence may be 

30 more precisely determined using manual sequencing methods, which are well known 
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in the art. An error in sequence which results in an insertion or deletion of one or 
more nucleotides may result in a frame shift in translation such that the predicted 
amino acid sequence will differ from that which would be predicted from the actual 
nucleotide sequence of the nucleic acid molecule, starting at the point of the mutation. 
5 The Hu-Asp DNA of the present invention includes cDNA, chemically synthesized 
DNA, DNA isolated by PCR, genomic DNA, and combinations thereof. Genomic 
Hu-Asp DNA may be obtained by screening a genomic library with the Hu-Asp2 
cDNA described herein, using methods that are well known in the art, or with 
oligonucleotides chosen from the Hu-Asp2 sequence that will prime the polymerase 

10 chain reaction (PCR). RNA transcribed from Hu-Asp DNA is also encompassed by 
the present invention. 

Due to the degeneracy of the genetic code, two DNA sequences may differ and 
yet encode identical amino acid sequences. The present invention thus provides 
isolated nucleic acid molecules having a polynucleotide sequence encoding any of the 

1 5 Hu-Asp polypeptides of the invention, wherein said polynucleotide sequence encodes 
a Hu-Asp polypeptide having the complete amino acid sequence of SEQ ID NO:2, 
SEQ ID NO:4, SEQ ID NO:6, or fragments thereof. 

Also provided herein are purified Hu-Asp polypeptides, both recombinant and 
non-recombinant. Most importantly, methods to produce Hu-Asp2 polypeptides in 

20 active form are provided. These include production of Hu-Asp2 polypeptides and 

variants thereof in bacterial cells, insect cells, and mammalian cells, also in forms that 
allow secretion of the Hu-Asp2 polypeptide from bacterial, insect or mammalian cells 
into the culture medium, also methods to produce variants of Hu-Asp2 polypeptide 
incorporating amino acid tags that facilitate subsequent purification. In a preferred 

25 embodiment of the invention the Hu-Asp2 polypeptide is converted to a 

proteolytically active form either in transformed cells or after purification and 
cleavage by a second protease in a cell-free system, such active forms of the Hu-Asp2 
polypeptide beginning with the N-terminal sequence TQHGIR or ETDEEP. The 
sequence TQHGIR represents the amino-terminus of Asp2(a) or Asp2(b) beginning 

30 with residue 22 of SEQ ID NO: 4 or 6, after cleavage of a putative 2 1 residue signal 
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peptide. Recombinant Asp2(a) expressed in and purified from insect cells' was 
observed to have this amino terminus, presumably as a result of cleavage by a signal 
peptidase. The sequence ETDEEP represents the amino-terminus of Asp2(a) or 
Asp2(b) beginning with residue 46 of SEQ ID NO: 4 or 6, as observed when Asp2(a) 
5 has been recombinantly produced in CHO cells (presumably after cleavage by both a 
rodent signal peptidase and another rodent peptidase that removes a propeptide 
sequence). The Asp2(a) produced in the CHO cells possesses p-secretase activity, as 
described in greater detail in Examples 1 1 and 12. Variants and derivatives, including 
fragments, of Hu-Asp proteins having the native amino acid sequences given in SEQ 

10 ID Nos: 2, 4, and 6 that retain any of the biological activities of Hu-Asp are also 

-within the scope of the present invention. Of course, one of ordinary skill in the art 
will readily be able to determine whether a variant, derivative, or fragment of a 
Hu-Asp protein displays Hu-Asp activity by subjecting the variant, derivative, or 
fragment to a standard aspartyl protease assay. Fragments of Hu-Asp within the scope 

15 of this invention include those that contain the active site domain containing the 

amino acid sequence DTG, fragments that contain the active site domain amino acid 
sequence DSG, fragments containing both the DTG and DSG active site sequences, 
fragments in which the spacing of the DTG and DSG active site sequences has been 
lengthened, fragments in which the spacing has been shortened. Also within the 

20 scope of the invention are fragments of Hu-Asp in which the transmembrane domain 
has been removed to allow production of Hu-Asp2 in a soluble form. In another 
embodiment of the invention, the two halves of Hu-Asp2, each containing a single 
active site DTG or DSG sequence can be produced independently as recombinant 
polypeptides, then combined in solution where they reconstitute an active protease. 

25 Thus, the invention provides a purified polypeptide comprising a fragment of a 

mammalian Asp2 protein, wherein said fragment lacks the Asp2 transmembrane 
domain of said Asp2 protein, and wherein the polypeptide and the fragment retain the 
P-secretase activity of said mammalian Asp2 protein. In a preferred embodiment, the 
purified polypeptide comprises a fragment of a human Asp2 protein that retains the P- 
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secretase activity of the human Asp2 protein from which it was derived. Examples 
include: 

a purified polypeptide that comprises a fragment of Asp2(a) having the 
amino acid sequence set forth in SEQ ID NO: 4, wherein the polypeptide lacks 
5 transmembrane domain amino acids 455 to 477 of SEQ ID NO: 4; 

a purified polypeptide as described in the preceding paragraph that 
further lacks cytoplasmic domain amino acids 478 to 501 of SEQ ID NO: 4; 

a purified polypeptide as described in either of the preceding 
paragraphs that further lacks amino acids 420-454 of SEQ ID NO: 4, which 
10 constitute a putative alpha helical region between the catalytic domain and the 

transmembrane domain that is believed to be unnecessary for P-secretase 
activity; 

a purified polypeptide that comprises an amino acid sequence that 
includes amino acids 58 to 419 of SEQ ED NO: 4, and that lacks amino acids 
15 22 to 57 of SEQ ID NO: 4; 

a purified polypeptide that comprises an amino acid sequence that 
includes amino acids 46 to 419 of SEQ ID NO: 4, and that lacks amino acids 
22 to 45 of SEQ ID NO: 4; 

a purified polypeptide that comprises an amino acid sequence that 
20 includes amino acids 22 to 454 of SEQ ID NO: 4. 

a purified polypeptide that comprises a fragment of Asp2(b) having the 
amino acid sequence set forth in SEQ ID NO: 6, and wherein said polypeptide 
lacks transmembrane domain amino acids 430 to 452 of SEQ ID NO: 6; 

a purified polypeptide as described in the preceding paragraph that 
25 further lacks cytoplasmic domain amino acids 453 to 476 of SEQ ID NO: 6; 

a purified polypeptide as described in either of the preceding two 
paragraphs that further lacks amino acids 395-429 of SEQ ID NO: 4, which 
constitute a putative alpha helical region between the catalytic domain and the 
transmembrane domain that is believed to be unnecessary for p-secretase 
30 activity; 
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a purified polypeptide comprising an amino acid sequence that 
includes amino acids 58 to 394 of SEQ ID NO: 4, and that lacks amino acids 
22 to 57 of SEQ ID NO: 4; 

a purified polypeptide comprising an amino acid sequence that 
5 includes amino acids 46 to 394 of SEQ ID NO: 4, and that lacks amino acids 

22 to 45 of SEQ ID NO: 4; and 

a purified polypeptide comprising an amino acid sequence that 
includes amino acids 22 to 429 of SEQ ID NO: 4. 
Also included as part of the invention is a purified polynucleotide comprising a 
10 nucleotide sequence that encodes such polypeptides; a vector comprising a 

polynucleotide that encodes such polypeptides; and a host cell transformed or 
trans fected with such a polynucleotide or vector. 

Hu-Asp variants may be obtained by mutation of native Hu- Asp-encoding 
nucleotide sequences, for example. A Hu-Asp variant, as referred to herein, is a 
1 5 polypeptide substantially homologous to a native Hu-Asp polypeptide but which has 
an amino acid sequence different from that of native Hu-Asp because of one or more , 
deletions, insertions, or substitutions in the amino acid sequence. The variant amino 
acid or nucleotide sequence is preferably at least about 80% identical, more 
preferably at least about 90% identical, and most preferably at least about 95% 
20 identical, to a native Hu-Asp sequence. Thus, a variant nucleotide sequence which 
contains, for example, 5 point mutations for every one hundred nucleotides, as 
compared to a native Hu-Asp gene, will be 95% identical to the native protein. The 
percentage of sequence identity, also termed homology, between a native and a variant 
Hu-Asp sequence may also be determined, for example, by comparing the two 
25 sequences using any of the computer programs commonly employed for this purpose, 
such as the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, Madison Wisconsin), which 
uses the algorithm of Smith and Waterman (Adv. Appl. Math. 2: 482-489 (1981)). 

Alterations of the native amino acid sequence may be accomplished by any of 
30 a number of known techniques. For example, mutations may be introduced at 
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particular locations by procedures well known to the skilled artisan, such as 
oligonucleotide-directed mutagenesis, which is described by Walder et ah (Gene 
42:133 (1986)); Bauer et ah (Gene 37:13 (1985)); Craik (BioTechniques, January 
1985, pp. 12-19); Smith et ah (Genetic Engineering: Principles and Methods^ 
5 Plenum Press (1981)); and U.S. Patent Nos. 4,518,584 and 4,737,462. 

Hu-Asp variants within the scope of the invention may comprise 
conservatively substituted sequences, meaning that one or more amino acid residues 
of a Hu-Asp polypeptide are replaced by different residues that do not alter the 
secondary and/or tertiary structure of the Hu-Asp polypeptide. Such substitutions may 

10 include the replacement of an amino acid by a residue having similar physicochemical 
properties, such as substituting one aliphatic residue (He, Val, Leu or Ala) for another, 
or substitution between basic residues Lys and Arg, acidic residues Glu and Asp, 
amide residues Gin and Asn, hydroxyl residues Ser and Tyr, or aromatic residues Phe 
and Tyr. Further information regarding making phenotypically silent amino acid 

1 5 exchanges may be found in Bowie et ah, Science 24 7: 1 306- 1310(1 990). Other 

Hu-Asp variants which might retain substantially the biological activities of Hu-Asp 
are those where amino acid substitutions have been made in areas outside functional 
regions of the protein. 

In another aspect, the invention provides an isolated nucleic acid molecule 

20 comprising a polynucleotide which hybridizes under stringent conditions to a portion 
of the nucleic acid molecules described above, e.g., to at least about 15 nucleotides, 
preferably to at least about 20 nucleotides, more preferably to at least about 30 
nucleotides, and still more preferably to at least about from 30 to at least about 100 
nucleotides, of one of the previously described nucleic acid molecules. Such portions 

25 of nucleic acid molecules having the described lengths refer to, e.g., at least about 15 
contiguous nucleotides of the reference nucleic acid molecule. By stringent 
hybridization conditions is intended overnight incubation at about 42°C for about 2.5 
hours in 6 X SSC/0.1% SDS, followed by washing of the filters four times for 15 
minutes in 1.0 X SSC at 65°C, 0.1% SDS. 
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Fragments of the Hu-Asp encoding nucleic acid molecules described herein, as 
well as polynucleotides capable of hybridizing to such nucleic acid molecules may be 
used as a probe or as primers in a polymerase chain reaction (PCR). Such probes may 
be used, e.g., to detect the presence of Hu-Asp nucleic acids in in vitro assays, as well 
5 as in Southern and northern blots. Cell types expressing Hu-Asp may also be 

identified by the use of such probes. Such procedures are well known, and the skilled 
artisan will be able to choose a probe of a length suitable to the particular application. 
For PCR, 5' and 3' primers corresponding to the termini of a desired Hu-Asp nucleic 
acid molecule are employed to isolate and amplify that sequence using conventional 
10 techniques. 

Other useful fragments of the Hu-Asp nucleic acid molecules are antisense or 
sense oligonucleotides comprising a single stranded nucleic acid sequence capable of 
binding to a target Hu-Asp mRNA (using a sense strand), or Hu-Asp DNA (using an 
antisense strand) sequence. In a preferred embodiment of the invention these Hu-Asp 

15 antisense oligonucleotides reduce Hu-Asp mRNA and consequent production of 
Hu-Asp polypeptides. 

In another aspect, the invention includes Hu-Asp polypeptides with or without 
associated native pattern glycosylation. Both Hu-Asp I and Hu-Asp2 have canonical 
acceptor sites for Asn-linked sugars, with Hu-Asp 1 having two of such sites, and 

20 Hu-Asp2 having four. Hu-Asp expressed in yeast or mammalian expression systems 
(discussed below) may be similar to or significantly different from a native Hu-Asp 
polypeptide in molecular weight and glycosylation pattern. Expression of Hu-Asp in 
bacterial expression systems will provide non-glycosylated Hu-Asp. 

The polypeptides of the present invention are preferably provided in an 

25 isolated form, and preferably are substantially purified. Hu-Asp polypeptides may be 
recovered and purified from tissues, cultured cells, or recombinant cell cultures by 
well-known methods, including ammonium sulfate or ethanol precipitation, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite 

30 chromatography, lectin chromatography, and high performance liquid chromatography 
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(HPLC). In a preferred embodiment, an amino acid tag is added to the Hu-Asp 
polypeptide using genetic engineering techniques that are well known to practitioners 
of the art which include addition of six histidine amino acid residues to allow 
purification by binding to nickel immobilized on a suitable support, epitopes for 
5 polyclonal or monoclonal antibodies including but not limited to the T7 epitope, the 
myc epitope, and the V5a epitope, and fusion of Hu-Asp2 to suitable protein partners 
including but not limited to glutathione-S-transferase or maltose binding protein. In a 
preferred embodiment these additional amino acid sequences are added to the 
C-terminus of Hu-Asp but may be added to the N-terminus or at intervening positions 

10 within the Hu-Asp2 polypeptide. 

The present invention also relates to vectors comprising the polynucleotide 
molecules of the invention, as well as host cell transformed with such vectors. Any of 
the polynucleotide molecules of the invention may be joined to a vector, which 
generally includes a selectable marker and an origin of replication, for propagation in 

15 a host. Because the invention also provides Hu-Asp polypeptides expressed from the 
polynucleotide molecules described above, vectors for the expression of Hu-Asp are 
preferred. The vectors include DNA encoding any of the Hu-Asp polypeptides 
described above or below, operably linked to suitable transcriptional or translational 
regulatory sequences, such as those derived from a mammalian, microbial, viral, or 

20 insect gene. Examples of regulatory sequences include transcriptional promoters, 
operators, or enhancers, mRNA ribosomal binding sites, and appropriate sequences 
which control transcription and translation. Nucleotide sequences are operably linked 
when the regulatory sequence functionally relates to the DNA encoding Hu-Asp. 
Thus, a promoter nucleotide sequence is operably linked to a Hu-Asp DNA sequence 

25 if the promoter nucleotide sequence directs the transcription of the Hu-Asp sequence. 

Selection of suitable vectors to be used for the cloning of polynucleotide 
molecules encoding Hu-Asp, or for the expression of Hu-Asp polypeptides, will of 
course depend upon the host cell in which the vector will be transformed, and, where 
applicable, the host cell from which the Hu-Asp polypeptide is to be expressed. 
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Suitable host cells for expression of Hu-Asp polypeptides include prokaryotes, yeast, 
and higher eukaryotic cells, each of which is discussed below. 

The Hu-Asp polypeptides to be expressed in such host cells may also be fusion 
proteins which include regions from heterologous proteins. Such regions may be 
5 included to allow, e.g., secretion, improved stability, or facilitated purification of the 
polypeptide. For example, a sequence encoding an appropriate signal peptide can be 
incorporated into expression vectors. A DNA sequence for a signal peptide (secretory 
leader) may be fused inframe to the Hu-Asp sequence so that Hu-Asp is translated as 
a fusion protein comprising the signal peptide. A signal peptide that is functional in 

10 the intended host cell promotes extracellular secretion of the Hu-Asp polypeptide. 
Preferably, the signal sequence will be cleaved from the Hu-Asp polypeptide upon 
secretion of Hu-Asp from the cell. Nonlimiting examples of signal sequences that can 
be used in practicing the invention include the yeast Ifactor and the honeybee melatin 
leader in sf9 insect cells. 

15 In a preferred embodiment, the Hu-Asp polypeptide will be a fusion protein 

which includes a heterologous region used to facilitate purification of the polypeptide. 
Many of the available peptides used for such a function allow selective binding of the 
fusion protein to a binding partner. For example, the Hu-Asp polypeptide may be 
modified to comprise a peptide to form a fusion protein which specifically binds to a 

20 binding partner, or peptide tag. Nonlimiting examples of such peptide tags include the 
6-His tag, thioredoxin tag, hemaglutinin tag, GST tag, and OmpA signal sequence tag. 
As will be understood by one of skill in the art, the binding partner which recognizes 
and binds to the peptide may be any molecule or compound including metal ions (e.g., 
metal affinity columns), antibodies, or fragments thereof, and any protein or peptide 

25 which binds the peptide, such as the FLAG tag. 

Suitable host cells for expression of Hu-Asp polypeptides includes 
prokaryotes, yeast, and higher eukaryotic cells. Suitable prokaryotic hosts to be used 
for the expression of Hu-Asp include bacteria of the genera Escherichia, Bacillus, and 
Salmonella, as well as members of the genera Pseudomonas, Streptomyces, and 

30 Staphylococcus. For expression in, e.g., E. coli, a Hu-Asp polypeptide may include 
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an N-terminal methionine residue to facilitate expression of the recombinant 
polypeptide in a prokaryotic host. The N-terminal Met may optionally then be 
cleaved from the expressed Hu-Asp polypeptide. Other N-terminal amino acid 
residues can be added to the Hu-Asp polypeptide to facilitate expression in 
5 Escherichia coli including but not limited to the T7 leader sequence, the T7-caspase 8 
leader sequence, as well as others leaders including tags for purification such as the 
6-His tag (Example 9). Hu-Asp polypeptides expressed in E. coli may be shortened 
by removal of the cytoplasmic tail, the transmembrane domain, or the membrane 
proximal region. Hu-Asp polypeptides expressed in E. coli may be obtained in either 

10 a soluble form or as an insoluble form which may or may not be present as an 

inclusion body. The insoluble polypeptide may be rendered soluble by guanidine 
HC1, urea or other protein denaturants, then refolded into a soluble form before or 
after purification by dilution or dialysis into a suitable aqueous buffer. If the inactive 
proform of the Hu-Asp was produced using recombinant methods, it may be rendered 

15 active by cleaving off the prosegment with a second suitable protease such as human 
immunodeficiency virus protease. 

Expression vectors for use in prokaryotic hosts generally comprises one or 
more phenotypic selectable marker genes. Such genes generally encode, e.g., a 
protein that confers antibiotic resistance or that supplies an auxotrophic requirement. 

20 A wide variety of such vectors are readily available from commercial sources. 

Examples include pSPORT vectors, pGEM vectors (Promega), pPROEX vectors 
(LT1, Bethesda, MD), Bluescript vectors (Stratagene), pET vectors (Novagen) and 
pQE vectors (Qiagen). 

Hu-Asp may also be expressed in yeast host cells from genera including 

25 Saccharomyces, Pichia, and Kluveromyces. Preferred yeast hosts are S. cerevisiae 
and P. pastoris. Yeast vectors will often contain an origin of replication sequence 
from a 2T yeast plasmid, an autonomously replicating sequence (ARS), a promoter 
region, sequences for polyadenylation, sequences for transcription termination, and a 
selectable marker gene. Vectors replicable in both yeast and E. coli (termed shuttle 

30 vectors) may also be used. In addition to the above-mentioned features of yeast 
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vectors, a shuttle vector will also include sequences for replication and selection in E. 
coli. Direct secretion of Hu-Asp polypeptides expressed in yeast hosts may be 
accomplished by the inclusion of nucleotide sequence encoding the yeast I-factor 
leader sequence at the 5' end of the Hu- Asp-encoding nucleotide sequence. 
5 Insect host cell culture systems may also be used for the expression of Hu-Asp 

polypeptides. In a preferred embodiment, the Hu-Asp polypeptides of the invention 
are expressed using an insect cell expression system (see Example 10). Additionally, a 
baculovirus expression system can be used for expression in insect cells as reviewed 
by Luckow and Summers, Bio/Technology 6:47 (1988). 

10 In another preferred embodiment, the Hu-Asp polypeptide is expressed in 

• mammalian host cells. Nonlimiting examples of suitable mammalian cell lines 
include the COS7 line of monkey kidney cells (Gluzman et al, Cell 23:175 (1981)), 
human embyonic kidney cell line 293, and Chinese hamster ovary (CHO) cells. 
Preferably, Chinese hamster ovary (CHO) cells are used for expression of Hu-Asp 

1 5 proteins (Example 1 1 ). 

The choice of a suitable expression vector for expression of the Hu-Asp 
polypeptides of the invention will of course depend upon the specific mammalian host 
cell to be used, and is within the skill of the ordinary artisan. Examples of suitable 
expression vectors include pcDNA3 (Invitrogen) and pS VL (Pharmacia Biotech). A 

20 preferred vector for expression of Hu-Asp polypeptides is pcDNA3. 1 -Hygro 
(Invitrogen). Expression vectors for use in mammalian host cells may include 
transcriptional and translational control sequences derived from viral genomes. 
Commonly used promoter sequences and enhancer sequences which may be used in 
the present invention include, but are not limited to, those derived from human 

25 cytomegalovirus (CMV), Adenovirus 2, Polyoma virus, and Simian virus 40 (S V40). 
Methods for the construction of mammalian expression vectors are disclosed, for 
example, in Okayama and Berg (Mol Cell Biol 3:280 (1983)); Cosman et al (Mol 
Immunol 23:935 (1986)); Cosman et al (Nature 312:768 (1984)); EP-A-0367566; 
and WO 91/18982. 
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The polypeptides of the present invention may also be used to raise polyclonal 
and monoclonal antibodies, which are useful in diagnostic assays for detecting 
Hu-Asp polypeptide expression. Such antibodies may be prepared by conventional 
techniques. See, for example, Antibodies: A Laboratory Manual, Harlow and Land 
5 (eds.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., (1988); 
Monoclonal Antibodies, Hybridomas: A New Dimension in Biological Analyses, 
Kennet et al (eds.), Plenum Press, New York (1980). Synthetic peptides comprising 
portions of Hu-Asp containing 5 to 20 amino acids may also be used for the 
production of polyclonal or monoclonal antibodies after linkage to a suitable carrier 

10 protein including but not limited to keyhole limpet hemacyanin (KLH), chicken 

ovalbumin, or bovine serum albumin using various cross-linking reagents including 
carbodimides, glutaraldehyde, or if the peptide contains a cysteine, 
N-methylmaleimide. A preferred peptide for immunization when conjugated to KLH 
contains the C-terminus of Hu-Asp 1 or Hu-Asp2 comprising 

1 5 QRRPRDPEWNDESSLVRHRWK (SEQ ID NO: 2, residues 497-5 1 8) or 

LRQQHDDFADDISLLK (SEQ ID NO:4, residues 486-501), respectively. See SEQ 
ID Nos. 33-34. 

The Hu-Asp nucleic acid molecules of the present invention are also valuable 
for chromosome identification, as they can hybridize with a specific location on a 

20 human chromosome. Hu-Asp 1 has been localized to chromosome 21, while 

Hu-Asp2 has been localized to chromosome 1 lq23. 3-24.1 . There is a current need for 
identifying particular sites on the chromosome, as few chromosome marking reagents 
based on actual sequence data (repeat polymorphisms) are presently available for 
marking chromosomal location. Once a sequence has been mapped to a precise 

25 chromosomal location, the physical position of the sequence on the chromosome can 
be correlated with genetic map data. The relationship between genes and diseases that 
have been mapped to the same chromosomal region can then be identified through 
linkage analysis, wherein the coinheritance of physically adjacent genes is determined. 
Whether a gene appearing to be related to a particular disease is in fact the cause of 
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the disease can then be determined by comparing the nucleic acid sequence between 
affected and unaffected individuals. 

in another embodiment, the invention relates to a method of assaying Hu-Asp 
function, specifically Hu-Asp2 function which involves incubating in solution the 
5 Hu-Asp polypeptide with a suitable substrate including but not limited to a synthetic 
peptide containing the P-secretase cleavage site of APP, preferably one containing the 
mutation found in a Swedish kindred with inherited AD in which KM is changed to 
NL, such peptide comprising the sequence SEVNLDAEFR in an acidic buffering 
solution, preferably an acidic buffering solution of pH5.5 (see Example 12) using 

10 cleavage of the peptide monitored by high performance liquid chromatography as a 
measure of Hu-Asp proteolytic activity. Preferred assays for proteolytic activity 
utilize internally quenched peptide assay substrates. Such suitable substrates include 
peptides which have attached a paired flurophore and quencher including but not 
limited to 7-amino-4-methyl coumarin and dinitrophenol, respectively, such that 

1 5 cleavage of the peptide by the Hu-Asp results in increased fluorescence due to 

physical separation of the flurophore and quencher. Other paired flurophores and 
quenchers include bodipy-tetramethylrhodamine and QSY-5 (Molecular Probes, Inc.). 
In a variant of this assay, biotin or another suitable tag may be placed on one end of 
the peptide to anchor the peptide to a substrate assay plate and a flurophore may be 

20 placed at the other end of the peptide. Useful flurophores include those listed above 
as well as Europium labels such as W8044 (EG&g Wallac, Inc.). Cleavage of the 
peptide by Asp2 will release the flurophore or other tag from the plate, allowing 
compounds to be assayed for inhibition of Asp2 proteolytic cleavage as shown by an 
increase in retained fluorescence. Preferred colorimetric assays of Hu-Asp proteolytic 

25 activity utilize other suitable substrates that include the P2 and PI amino acids 

comprising the recognition site for cleavage linked to o-nitrophenol through an amide 
linkage, such that cleavage by the Hu-Asp results in an increase in optical density 
after altering the assay buffer to alkaline pH. 
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In another embodiment, the invention relates to a method for the identification 
of an agent that increases the activity of a Hu-Asp polypeptide selected from the group 
consisting of Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b), the method comprising 

(a) determining the activity of said Hu-Asp polypeptide in the presence of 
5 a test agent and in the absence of a test agent; and 

(b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide determined in 
the absence of said test agent; 

whereby a higher level of activity in the presence of said test agent than in the absence 
10 of said test agent indicates that said test agent has increased the activity of said 

Hu-Asp polypeptide. Such tests can be performed with Hu-Asp polypeptide in a cell 
free system and with cultured cells that express Hu-Asp as well as variants or 
isoforms thereof. 

In another embodiment, the invention relates to a method for the identification 
15 of an agent that decreases the activity of a Hu-Asp polypeptide selected from the 

group consisting of Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b), the method comprising 

(a) determining the activity of said Hu-Asp polypeptide in the presence of 
a test agent and in the absence of a test agent; and 

(b) comparing the activity of said Hu-Asp polypeptide determined in the 
20 presence of said test agent to the activity of said Hu-Asp polypeptide determined in 

the absence of said test agent; whereby a lower level of activity in the presence of said 
test agent than in the absence of said test agent indicates that said test agent has 
decreased the activity of said Hu-Asp polypeptide. Such tests can be performed with 
Hu-Asp polypeptide in a cell free system and with cultured cells that express Hu-Asp 

25 as well as variants or isoforms thereof 

In another embodiment, the invention relates to a novel cell line (HEK125.3 
cells) for measuring processing of amyloid P peptide (AP) from the amyloid protein 
precursor (APP). The cells are stable transformants of human embryonic kidney 293 
cells (HEK293) with a bicistronic vector derived from plRES-EGFP (Clontech) 

30 containing a modified human APP cDNA, an internal ribosome entry site and an 
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enhanced green fluorescent protein (EGFP) cDNA in the second cistron. The APP 
cDNA was modified by adding two lysine codons to the carboxyl terminus of the 
APP coding sequence. This increases processing of Ap peptide from human APP by 
2-4 fold. This level of AP peptide processing is 60 fold higher than is seen in 
5 nontransformed HEK293 cells. HEK125.3 cells will be useful for assays of 

compounds that inhibit AP peptide processing. This invention also includes addition 
of two lysine residues to the C-terminus of other APP isoforms including the 751 and 
770 amino acid isoforms, to isoforms of APP having mutations found in human AD 
including the Swedish KM-NL and V717-F mutations, to C-terminal fragments of 

10 APP, such as those beginning with the p-secretase cleavage site, to C-terminal 

fragments of APP containing the p-secretase cleavage site which have been operably 
linked to an N-terminal signal peptide for membrane insertion and secretion, and to 
C-terminal fragments of APP which have been operably linked to an N-terminal 
signal peptide for membrane insertion and secretion and a reporter sequence including 

15 but not limited to green fluorescent protein or alkaline phosphatase, such that 

P-secretase cleavage releases the reporter protein from the surface of cells expressing 
the polypeptide. 

Having generally described the invention, the same will be more 
readily understood by reference to the following examples, which are provided by way 
20 of illustration and are not intended as limiting. 

Example 1 

Development of a Search Algorithm Useful for the 
Identification of Aspartyl Proteases, and Identification 
25 of C. elegans Aspartyl Protease Genes in Wormpep 12 

Materials and Methods: 

Classical aspartyl proteases such as pepsin and renin possess a two-domain structure 
which folds to bring two aspartyl residues into proximity within the active site. These 
are embedded in the short tripeptide motif DTG, or more rarely, DSG. The DTG or 
30 DSG active site motif appears at about residue 25-30 in the enzyme, but at about 
65-70 in the proenzyme (prorenin, pepsinogen). This motif appears again about 



-48- 



WO 01/23533 



PCTYUS00/26080 



1 50-200 residues downstream. The proenzyme is activated by cleavage of the 
N-terminal prodomain. This pattern exemplifies the double domain structure of the 
modem day aspartyl enzymes which apparently arose by gene duplication and 
divergence. Thus; 

5 NH 2 X D 25 TG Y D Y+25 TG C 

where X denotes the beginning of the enzyme, following the N-terminal prodomain, 
and Y denotes the center of the molecule where the gene repeat begins again. 

In the case of the retroviral enzymes such as the HIV protease, they represent 
only a half of the two-domain structures of well-known enzymes like pepsin, 
10 cathepsin D, renin, etc. They have no prosegment, but are carved out of a polyprotein 
precursor containing the gag and pol proteins of the virus. They can be represented 
by: 

NH 2 D 25 TG CI 00 

This "monomer" only has about 1 00 aa, so is extremely parsimonious as compared to 

1 5 the other aspartyl protease "dimers" which have of the order of 330 or so aa, not 
counting the N-terminal prodomain. 

The limited length of the eukaryotic aspartyl protease active site motif makes 
it difficult to search EST collections for novel sequences. EST sequences typically 
average 250 nucleotides, and so in this case would be unlikely to span both aspartyl 

20 protease active site motifs. Instead, we turned to the C. elegans genome. The C. 
elegans genome is estimated to contain around 13,000 genes. Of these, roughly 
12,000 have been sequenced and the corresponding hypothetical open reading frame 
(ORF) has been placed in the database Wormpepl 2. We used this database as the 
basis for a whole genome scan of a higher eukaryote for novel aspartyl proteases, 

25 using an algorithm that we developed specifically for this purpose. The following 
AWK script for locating proteins containing two DTG or DSG motifs was used for 
the search, which was repeated four times to recover all pairwise combinations of the 
aspartyl motif. 

BEGIN {RS=">"} /* defines ">" as record separator for FASTA format */ 

30 { 

pos - index($0,"DTG") /* finds "DTG" in record*/ 
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if (pos>0) { 

rest = substr($0,pos+3) /*get rest of record after first DTG*/ 

pos2 = index(rest,"DTG") /*find second DTG*/ 

if (pos2>0) printf C i %s%s\n"/ , >",$0)} /*report hits*/ 

5 } 
> 

The AWK script shown above was used to search Wormpepl2, which was 

downloaded from ftp.sanger.ac.uk/pub/databases/wormpep, for sequence entries 

containing at least two DTG or DSG motifs. Using AWK limited each record to 3000 
10 characters or less. Thus, 35 or so larger records were eliminated manually from 

Wormpepl2 as in any case these were unlikely to encode aspartyl proteases. 

Results and Discussion: 

The Wormpep 12 database contains 12,178 entries, although some of these 

(<10%) represent alternatively spliced transcripts from the same gene. Estimates of 
1 5 the number of genes encoded in the C. elegans genome is on the order of 1 3,000 

genes, so Wormpep 12 may be estimated to cover greater than 90% of the C. elegans 

genome. 

Eukaryotic aspartyl proteases contain a two-domain structure, probably arising 
from ancestral gene duplication. Each domain contains the active site motif D(S/T)G 

20 located from 20-25 amino acid residues into each domain. The retroviral (e.g., HIV 
protease) or retrotransposon proteases are homodimers of subunits which are 
homologous to a single eukaryotic aspartyl protease domain. An AWK script was 
used to search the Wormpepl2 database for proteins in which the D(S/T)G motif 
occurred at least twice. This identified >60 proteins with two DTG or DSG motifs. 

25 Visual inspection was used to select proteins in which the position of the aspartyl 
domains was suggestive of a two-domain structure meeting the criteria described 
above. 

In addition, the PROSITE eukaryotic and viral aspartyl protease active site 
pattern PS00141 was used to search Wormpep 12 for candidate aspartyl proteases. 
30 (Bairoch A., Bucher P., Hofinann K., The PROSITE database: its status in 1997, 
Nucleic Acids Res. 24:2) 7-221(1997)). This generated an overlapping set of 
Wormpepl2 sequences. Of these, seven sequences contained two DTG or DSG 
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motifs and the PROSITE aspartyl protease active site pattern. Of these seven, three 
were found in the same cosmid clone (F21F8.3, F21F8.4, and F21F8.7) suggesting 
that they represent a family of proteins that arose by ancestral gene duplication. Two 
other ORFs with extensive homology to F21F8.3, F21F8.4 and F21F8.7 are present in 
5 the same gene cluster (F21F8.2 and F21F8.6), however, these contain only a single 
DTG motif. Exhaustive BLAST searches with these seven sequences against 
Wormpepl2 failed to reveal additional candidate aspartyl proteases in the C. elegans 
genome containing two repeats of the DTG or DSG motif. 

BLASTX search with each C. elegans sequence against SWISS-PROT, 
10 GenPep and TREMBL revealed that Rl 2H7.2 was the closest worm homologue to the 
known mammalian aspartyl proteases, and that T18H9.2 was somewhat more 
distantly related, while CEASP1, F21F8.3, F21F8.4, and F21F8.7 formed a subcluster 
which had the least sequence homology to the mammalian sequences. 
Discussion: 

15 APP, the presenilins, and p35, the activator of cdk5, all undergo intracellular 

proteolytic processing at sites which conform to the substrate specificity of the HIV 
protease. Dysregulation of a cellular aspartyl protease with the same substrate 
specificity, might therefore provide a unifying mechanism for causation of the plaque 
and tangle pathologies in AD. Therefore, we sought to identify novel human aspartyl 

20 proteases. A whole genome scan in C. elegans identified seven open reading frames 
that adhere to the aspartyl protease profile that we had identified. These seven 
aspartyl proteases probably comprise the complete complement of such proteases in a 
simple, multicellular eukaryote. These include four closely related aspartyl proteases 
unique to C. elegans which probably arose by duplication of an ancestral gene. The 

25 other three candidate aspartyl proteases (T18H9.2, R12H7.2 and CI 1D2.2) were 
found to have homology to mammalian gene sequences. 



30 
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Example 2 

Identification of Novel Human Aspartyl 
Proteases Using Database Mining by Genome Bridging 

5 Materials and Methods: 

Computer-assisted analysis of EST databases, cDNA , and predicted polypeptide 
sequences: 

Exhaustive homology searches of EST databases with the CEASP1, F21F8.3, 
F21F8.4, and F21F8.7 sequences failed to reveal any novel mammalian homologues. 

10 TBLASTN searches with R12H7.2 showed homology to cathepsin D, cathepsin E, 

pepsinogen A, pepsinogen C and renin, particularly around the DTG motif within the 
active site, but also failed to identify any additional novel mammalian aspartyl 
proteases. This indicates that the C. elegans genome probably contains only a single 
lysosomal aspartyl protease which in mammals is represented by a gene family that 

15 arose through duplication and consequent modification of an ancestral gene. 

TBLASTN searches with T18H9.2, the remaining C. elegans sequence, 
identified several ESTs which assembled into a contig encoding a novel human 
aspartyl protease (Hu-ASPl). As is described above in Example 1, BLASTX search 
with the Hu-ASPl contig against SWISS-PROT revealed that the active site motifs in 

20 the sequence aligned with the active sites of other aspartyl proteases. Exhaustive, 
repetitive rounds of BLASTN searches against LifeSeq, LifeSeqFL, and the public 
EST collections identified 102 EST from multiple cDNA libraries that assembled into 
a single contig. The 51 sequences in this contig found in public EST collections also 
have been assembled into a single contig (THC2 13329) by The Institute for Genome 

25 Research (TIGR). The TIGR annotation indicates that they failed to find any hits in 
the database for the contig. Note that the TIGR contig is the reverse complement of 
the LifeSeq contig that we assembled. BLASTN search of Hu-ASPl against the rat 
and mouse EST sequences in ZooSeq revealed one homologous EST in each database 
(Incyte clone 70031 1523 and IMAGE clone 313341, GenBank accession number 

30 W 1 0530, respectively). 
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TBLASTN searches with the assembled DNA sequence for Hu-ASPl against 
both LifeSeqFL and the public EST databases identified a second, related human 
sequence (Hu-Asp2) represented by a single EST (2696295). Translation of this 
partial cDNA sequence reveals a single DTG motif which has homology to the active 
5 site motif of a bovine aspartyl protease, NM1. 

BLAST searches, contig assemblies and multiple sequence alignments were 
performed using the bioinformatics tools provided with the LifeSeq, LifeSeqFL and 
LifeSeq Assembled databases from Incyte. Predicted protein motifs were identified 
using either the ProSite dictionary (Motifs in GCG 9) or the Pfam database. 

1 0 Full-length cDNA cloning of Hu-Aspl 

The open reading frame of C. elegans gene T18H9.2CE was used to query 
Incyte LifeSeq and LifeSeq-FL databases and a single electronic assembly referred to 
as 1863920CE1 was detected. The 5' most cDNA clone in this contig, 1863920, was 
obtained from Incyte and completely sequenced on both strands. Translation of the 
. 15 open reading frame contained within clone 1 863920 revealed the presence of the 
duplicated aspartyl protease active site motif (DTG/DSG) but the 5' end was 
incomplete. The remainder of the Hu-Aspl coding sequence was determined by 5' 
Marathon RACE analysis using a human placenta Marathon ready cDNA template 
(Clontech). A 3'-antisense oligonucleotide primer specific for the 5' end of clone 

20 1863920 was paired with the S'-sense primer specific for the Marathon ready cDNA 
synthetic adaptor in the PCR. Specific PCR products were directly sequenced by 
cycle sequencing and the resulting sequence assembled with the sequence of clone 
1 863920 to yield the complete coding sequence of Hu-Asp-1 (SEQ ID No. 1 ). 

Several interesting features are present in the primary amino acid sequence of 

25 Hu-Aspl (Figure 1, SEQ ID No. 2). The sequence contains a signal peptide (residues 
1-20 in SEQ ID No. 2), a pro-segment, and a catalytic domain containing two copies 
of the aspartyl protease active site motif (DTG/DSG). The spacing between the first 
and second active site motifs is about 200 residues which should correspond to the 
expected size of a single, eukaryotic aspartyl protease domain. More interestingly, the 

30 sequence contains a predicted transmembrane domain (residues 469-492 in SEQ ID 
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No. 2) near its C-terminus which suggests that the protease is anchored in the 
membrane. This feature is not found in any other aspartyl protease. 

Cloning of a full-length Hu-Asp-2 cDNAs: 
5 As is described above in Example 1 , genome wide scan of the Caenorhabditis 

elegans database WormPepl2 for putative aspartyl proteases and subsequent mining 
of human EST databases revealed a human ortholog to the G elegans gene T18H9.2 
referred to as Hu-Aspl . The assembled contig for Hu-Aspl was used to query for 
human paralogs using the BLAST search tool in human EST databases and a single 

10 significant match (2696295CE1) with approximately 60% shared identity was found 
in the LifeSeq FL database. Similar queries of either gbl05PubEST or the family of 
human databases available from TIGR did not identify similar EST clones. cDNA 
clone 2696295, identified by single pass sequence analysis from a human uterus 
cDNA library, was obtained from Incyte and completely sequence on both strands. 

15 This clone contained an incomplete 1266 bp open-reading frame that encoded a 422 
amino acid polypeptide but lacked an initiator ATG on the 5' end. Inspection of the 
predicted sequence revealed the presence of the duplicated aspartyl protease active 
site motif DTG/DSG, separated by 194 amino acid residues. Subsequent queries of 
later releases of the LifeSeq EST database identified an additional ESTs, sequenced 

20 from a human astrocyte cDNA library (4386993), that appeared to contain additional 
5' sequence relative to clone 2696295. Clone 4386993 was obtained from Incyte and 
completely sequenced on both strands. Comparative analysis of clone 4386993 and 
clone 2696295 confirmed that clone 4386993 extended the open-reading frame by 31 
amino acid residues including two in-frame translation initiation codons. Despite the 

25 presence of the two in-frame ATGs, no in-frame stop codon was observed upstream 
of the ATG indicating that the 4386993 may not be full-length. Furthermore, 
alignment of the sequences of clones 2696295 and 4386993 revealed a 75 base pair 
insertion in clone 2696295 relative to clone 4386993 that results in the insertion of 25 
additional amino acid residues in 2696295. The remainder of the Hu-Asp2 coding 

30 sequence was determined by 5' Marathon RACE analysis using a human 
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hippocampus Marathon ready cDNA template (Clontech). A 3*-antisense 
oligonucleotide primer specific for the shared 5'-region of clones 2696295 and 
4386993 was paired with the 5 '-sense primer specific for the Marathon ready cDNA 
synthetic adaptor in the PCR. Specific PCR products were directly sequenced by 
5 cycle sequencing and the resulting sequence assembled with the sequence of clones 

2696295 and 4386993 to yield the complete coding sequence of Hu-Asp2(a) (SEQ ID 
No. 3) and Hu-Asp2(b) (SEQ ID No. 5), respectively. 

Several interesting features are present in the primary amino acid sequence of 
Hu-Asp2(a) (Figure 2 and SEQ ID No. 4) and Hu-Asp-2(b) (Figure 3, SEQ ID No. 6). 

10 Both sequences contain a signal peptide (residues 1-21 in SEQ ID No. 4 and SEQ ID 
No. 6), a pro-segment, and a catalytic domain containing two copies of the aspartyl 
protease active site motif (DTG/DSG). The spacing between the first and second 
active site motifs is variable due to the 25 amino acid residue deletion in Hu-Asp-2(b) 
and consists of 168-ver.sw.s-194 amino acid residues, for Hu-Asp2(b) and 

15 Hu-Asp-2(a), respectively. More interestingly, both sequences contains a predicted 
transmembrane domain (residues 455-477 in SEQ ID No.4 and 430-452 in SEQ ID 
No. 6) near their C-termini which indicates that the protease is anchored in the 
membrane. This feature is not found in any other aspartyl protease except Hu-Aspl . 



20 Example 3 

Molecular cloning of mouse Asp2 cDNA and genomic DNA. 

Cloning and characterization of murine Asp2 cDNA. 

The murine ortholog of Hu-Asp2 was cloned using a combination of cDNA 
library screening, PCR, and genomic cloning. Approximately 500,000 independent 

25 clones from a mouse brain cDNA library were screened using a 32 P-labeled coding 

sequence probe prepared from Hu-Asp2. Replicate positives were subjected to DNA 
sequence analysis and the longest cDNA contained the entire 3' untranslated region 
and 47 amino acids in the coding region. PCR amplification of the same mouse brain 
cDNA library with an antisense oligonucleotide primer specific for the 5 '-most 

30 cDNA sequence determined above and a sense primer specific for the 5' region of 
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human Asp2 sequence followed by DNA sequence analysis gave an additional 980 bp 
of the coding sequence. The remainder of the 5' sequence of murine Asp-2 was 
derived from genomic sequence (see below). 

5 Isolation and sequence analysis of the murine Asp-2 gene. 

A murine EST sequence encoding a portion of the murine Asp2 cDNA was 
identified in the GenBank EST database using the BLAST search tool and the 
Hu-Asp2 coding sequence as the query. Clone g3 160898 displayed 88% shared 
identity to the human sequence over 352 bp. Oligonucleotide primer pairs specific for 

10 this region of murine Asp2 were then synthesized and used to amplify regions of the 
murine gene. Murine genomic DNA, derived from strain 129/SvJ, was amplified in 
the PCR (25 cycles) using various primer sets specific for murine Asp2 and the 
products analyzed by agarose gel electrophoresis. The primer set Zoo-1 and Zoo-4 
amplified a 750 bp fragment that contained approximately 600 bp of intron sequence 

15 based on comparison to the known cDNA sequence. This primer set was then used to 
screen a murine BAC library by PCR, a single genomic clone was isolated and this 
cloned was confirmed contain the murine Asp2 gene by DNA sequence analysis. 
Shotgun DNA sequencing of this Asp2 genomic clone and comparison to the cDNA 
sequences of both Hu-Asp2 and the partial murine cDNA sequences defined the 

20 full-length sequence of murine Asp2 (SEQ ID No. 7). The predicted amino acid 
sequence of murine Asp2 (SEQ ID No. 8) showed 96.4% shared identity (GCG 
BestFit algorithm) with 18/501 amino acid residue substitutions compared to the 
human sequence (Figure 4). The proteolytic processing of murine Asp2(a) is believed 
to be analogous to the processing described above for human Asp2(a). In addition, a 

25 variant lacking amino acid residues 190-214 of SEQ ID NO: 8 is specifically 

contemplated as a murine Asp2(b) polypeptide. All forms of murine Asp2(b) gene 
and protein are intended as aspects of the invention. 



30 
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Example 4 

Tissue Distribution of Expression of Hu-Asp2 Transcripts 

Materials and Methods: 

The tissue distribution of expression of Hu-Asp-2 was determined using 
5 multiple tissue Northern blots obtained from Clontech (Palo Alto, CA). Incyte clone 
2696295 in the vector pINCY was digested to completion with EcoRUNotl and the 1.8 
kb cDNA insert purified by preparative agarose gel electrophoresis. This fragment 
was radiolabeled to a specific activity > 1 X 10 9 dpm/fig by random priming in the 
presence of [a- 32 P-dATP] (>3000 Ci/mmol, Amersham, Arlington Heights, IL) and 

10 Klenow fragment of DNA polymerase I. Nylon filters containing denatured, size 
fractionated poly A + RNAs isolated from different human tissues were hybridized 
with 2 x 10 6 dpm/ml probe in ExpressHyb buffer (Clontech, Palo Alto, CA) for 1 hour 
at 68 °C and washed as recommended by the manufacture. Hybridization signals were 
visualized by autoradiography using BioMax XR film (Kodak, Rochester, NY) with 

15 intensifying screens at -80 °C. 

Results and Discussion: 

Limited information on the tissue distribution of expression of Hu-Asp-2 
transcripts was obtained from database analysis due to the relatively small number of 

20 ESTs detected using the methods described above (< 5). In an effort to gain further 
information on the expression of the Hu-Asp2 gene, Northern analysis was employed 
to determine both the size(s) and abundance of Hu-Asp2 transcripts. PolyA* RNAs 
isolated from a series of peripheral tissues and brain regions were displayed on a solid 
support following separation under denaturing conditions and Hu-Asp2 transcripts 

25 were visualized by high stringency hybridization to radiolabeled insert from clone 
2696295. The 2696295 cDNA probe visualized a constellation of transcripts that 
migrated with apparent sizes of 3.0kb, 4.4 kb and 8.0 kb with the latter two transcript 
being the most abundant. 

Across the tissues surveyed, Hu-Asp2 transcripts were most abundant in 

30 pancreas and brain with lower but detectable levels observed in all other tissues 
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examined except thymus and PBLs. Given the relative abundance of Hu-Asp2 
transcripts in brain, the regional expression in brain regions was also established. A 
similar constellation of transcript sizes were detected in all brain regions examined 
[cerebellum, cerebral cortex, occipital pole, frontal lobe, temporal lobe and putamen] 
5 with the highest abundance in the medulla and spinal cord. 

Example 5 

Northern Blot Detection of HuAsp-1 and 
HuAsp-2 Transcripts in Human Cell Lines 

10 A variety of human cell lines were tested for their ability to produce Hu-Aspl 

and Asp2 mRNA. Human embryonic kidney (HEK-293) cells, African green monkey 
(Cos-7) cells, Chinese hamster ovary (CHO) cells, HELA cells, and the 
neuroblastoma cell line 1MR-32 were all obtained from the ATCC. Cells were 
cultured in DME containing 10% FCS except CHO cells which were maintained in 

15 a-MEM/10% FCS at 37 °C in 5% C0 2 until they were near confluence. Washed 
monolayers of cells (3 X 10 7 ) were lysed on the dishes and poly A + RNA extracted 
using the Qiagen Oligotex Direct mRNA kit. Samples containing 2 p.g of poly A + 
RNA from each cell line were fractionated under denaturing conditions 
(glyoxal-treated), transferred to a solid nylon membrane support by capillary action, 

20 . and transcripts visualized by hybridization with random-primed labeled ( 32 P) coding 
sequence probes derived from either Hu-Aspl or Hu-Asp2. Radioactive signals were 
detected by exposure to X-ray film and by image analysis with a Phosphorlmager. 

The Hu-Aspl cDNA probe visualized a similar constellation of transcripts (2.6 
kb and 3.5 kb) that were previously detected is human tissues. The relative abundance 

25 determined by quantification of the radioactive signal was Cos-7 > HEK 292 = HELA 
> IMR32. 

The Hu-Asp2 cDNA probe also visualized a similar constellation of transcripts 
compared to tissue (3.0 kb, 4.4 kb, and 8.0 kb) with the following relative abundance; 
HEK 293 > Cos 7 > IMR32 > HELA. 

30 
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Example 6 

Modification of APP to increase Ap processing for in vitro screening 

Human cell lines that process AP peptide from APP provide a means to screen 
in cellular assays for inhibitors of P- and y-secretase. Production and release of Ap 
5 peptide into the culture supernatant is monitored by an enzyme-linked immunosorbent 
assay (E1A). Although expression of APP is widespread and both neural and 
non-neuronal cell lines process and release Ap peptide, levels of endogenous APP 
processing are low and difficult to detect by E1A. Ap processing can be increased by 
expressing in transformed cell lines mutations of APP that enhance Ap processing. 
10 We made the serendipitous observation that addition of two lysine residues to the 

carboxyl terminus of APP695 increases Ap processing still further. This allowed us 
to create a transformed cell line that releases Ap peptide into the culture medium at 
the remarkable level of 20,000 pg/ml. 

1 5 Materials And Methods 

Materials: 

Human embryonic kidney cell line 293 (HEK293 cells) were obtained 
internally. The vector pERES-EGFP was purchased from Clontech. Oligonucleotides 
for mutation using the polymerase chain reaction (PCR) were purchased from 
20 Genosys. A plasmid containing human APP695 (SEQ ID No. 9 [nucleotide] and SEQ 
ID No. 10 [amino acid]) was obtained from Northwestern University Medical School. 
This was subcloned into pSK (Stratagene) at the Not\ site creating the plasmid 
pAPP695. 

Mutagenesis protocol: 

25 The Swedish mutation (K670N, M671 L) was introduced into pAPP695 using 

the Stratagene Quick Change Mutagenesis Kit to create the plasmid pAPP695NL 
(SEQ ID No. 1 1 [nucleotide] and SEQ ID No. 12 [amino acid]). To introduce a 
di-lysine motif at the C-terminus of APP695, the forward primer #276 5' 
GACTGACCACTCGACCAGGTTC (SEQ ID No. 47) was used with the "patch" 

30 primer #274 5' 
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CGAATTAAATTCCAGCACACTGGCTACTTCTTGTTCTGCATCTCAAAGAAC 
(SEQ ID No. 48) and the flanking primer #275 

CGAATTAAATTCCAGCACACTGGCTA (SEQ ID No. 49) to modify the 3' end of 
the APP695 cDNA (SEQ ID No. 15 [nucleotide] and SEQ ID No. 16 [amino acid]). 
5 This also added a BstXl restriction site that will be compatible with the BstXl site in 
the multiple cloning site of pIRES-EGFP. PCR amplification was performed with a 
Clontech HF Advantage cDNA PCR kit using the polymerase mix and buffers 
supplied by the manufacturer. For "patch" PCR, the patch primer was used at l/20th 
the molar concentration of the flanking primers. PCR amplification products were 

10 purified using a QIAquick PCR purification kit (Qiagen). After digestion with 

restriction enzymes, products were separated on 0.8% agarose gels and then excised 
DNA fragments were purified using a QIAquick gel extraction kit (Qiagen). 

To reassemble a modified APP695-Sw cDNA, the 5' Notl-Bgl2 fragment of 
the APP695-Sw cDNA and the 3' Bgl2-BstXl APP695 cDNA fragment obtained by 

1 5 PCR were ligated into pIRES-EGFP plasmid DNA opened at the Notl and BstX 1 
sites. Ligations were performed for 5 minutes at room temperature using a Rapid 
DNA Ligation kit (Boehringer Mannheim) and transformed into Library Efficiency 
DH5a Competent Cells (GibcoBRL Life Technologies). Bacterial colonies were 
screened for inserts by PCR amplification using primers #276 and #275. Plasmid 

20 DNA was purified for mammalian cell transfection' using a QIAprep Spin Miniprep 
kit (Qiagen). The construct obtained was designated pMG 125.3 (APPSW-KK, SEQ 
ID No. 17 [nucleotide] and SEQ ID No. 18 [amino acid]). 
Mammalian Cell Transfection: 

HEK293 cells for transfection were grown to 80% confluence in Dulbecco's 

25 modified Eagle's medium (DMEM) with 10% fetal bovine serum. Cotransfections 

were performed using LipofectAmine (Gibco-BRL) with 3 \xg pMG 125.3 DNA and 9 
jig pcDNA3.1 DNA per 10 x 10 6 cells. Three days posttransfection, cells were 
passaged into medium containing G418 at a concentration of 400 (ig/ml. After three 
days growth in selective medium, cells were sorted by their fluorescence. 

30 Clonal Selection of 125.3 cells by FACS: 
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Cell samples were analyzed on an EPICS Elite ESP flow cytometer (Coulter, 
Hialeah, FL) equipped with a 488 nm excitation line supplied by an air-cooled argon 
laser. EGFP emission was measured through a 525 nm band-pass filter and 
fluorescence intensity was displayed on a 4-decade log scale after gating on viable 
5 cells as determined by forward and right angle light scatter. Single green cells were 
separated into each well of one 96 well plate containing growth medium without 
G41 8. After a four day recovery period, G41 8 was added to the medium to a final 
concentration of 400 ^g/ml. After selection, 32% of the wells contained expanding 
clones. Wells with clones were expanded from the 96 well plate to a 24 well plate 
10 and then a 6 well plate with the fastest growing colonies chosen for expansion at each 
passage. The final cell line selected was the fastest growing of the final six passaged. 
This clone, designated 125.3, has been maintained in G418 at 400 ug/ml with passage 
every four days into fresh medium. No loss of Ap production of EGFP fluorescence 
has been seen over 23 passages. 

15 

AfiEIA Analysis (Double Antibody Sandwich ELISA for hAfi 1-40/42): 

Cell culture supematants harvested 48 hours after transfection were analyzed 
in a standard Ap EIA as follows. Human Ap 1-40 or 1-42 was measured using 
monoclonal antibody (mAb) 6E10 (Senetek, St. Louis, MO) and biotinylated rabbit 

20 antiserum 162 or 164 (New York State Institute for Basic Research, Staten Island, 

NY) in a double antibody sandwich ELISA. The capture antibody 6E10 is specific to 
an epitope present on the N-terminal amino acid residues 1-16 of hAp. The 
conjugated detecting antibodies 162 and 164 are specific for hAp 1-40 and 1-42, 
respectively. Briefly, a Nunc Maxisorp 96 well immunoplate was coated with 100 

25 p.l/well of mAb 6E1 0 (5p.g/ml) diluted in 0.1 M carbonate-bicarbonate buffer, pH 9.6 
and incubated at 4°C overnight. After washing the plate 3x with 0.0 1M DPBS 
(Modified Dulbecco's Phosphate Buffered Saline (0.008M sodium phosphate, 
0.002M potassium phosphate, 0.14M sodium chloride, 0.01 M potassium chloride, pH 
7.4) from Pierce, Rockford, II) containing 0.05% of Tween-20 (DPBST), the plate 

30 was blocked for 60 minutes with 200 \i\ of 10% normal sheep serum (Sigma) in 
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0.01M DPBS to avoid non-specific binding. Human Ap 1-40 or 1-42 standards 100 
(al/well (Bachem, Torrance, CA) diluted, from a lmg/ml stock solution in DMSO, in 
culture medium was added after washing the plate, as well as 100 ^il/well of sample, 
e.g., conditioned medium of transfected cells. 
5 The plate was incubated for 2 hours at room temperature and 4°C overnight. 

The next day, after washing the plate, 100 ^1/well biotinylated rabbit antiserum 162 
1 :400 or 164 1 :50 diluted in DPBST + 0.5% BSA was added and incubated at room 
temperature for lhour, 15 minutes. Following washes, 100 |il/well 
neutravidin-horseradish peroxidase (Pierce, Rockford, II) diluted 1:10,000 in DPBST 

10 was applied and incubated for 1 hour at room temperature. After the last washes 100 
[il/well of o-phenylnediamine dihydrochloride (Sigma Chemicals, St. Louis, MO) in 
50mM citric acid/1 OOmM sodium phosphate buffer (Sigma Chemicals, St. Louis, 
MO), pH 5.0, was added as substrate and the color development was monitored at 
450nm in a kinetic microplate reader for 20 minutes using Soft max Pro software. All 

1 5 standards and samples were run in triplicates. The samples with absorbance values 
falling within the standard curve were extrapolated from the standard curves using 
Soft max Pro software and expressed in pg/ml culture medium. 
Results: 

Addition of two lysine residues to the carboxyl terminus of APP695 greatly 
20 increases Ap processing in HEK293 cells as shown by transient expression (Table 1 ). 
Addition of the di-lysine motif to APP695 increases Ap processing to that seen with 
the APP695 containing the Swedish mutation. Combining the di-lysine motif with the 
Swedish mutation further increases processing by an additional 2.8 fold. 

Cotransformation of HEK293 cells with pMG125.3 and pcDNA3.1 allowed 
25 dual selection of transformed cells for G41 8 resistance and high level expression of 
EGFP. After clonal selection by FACS, the cell line obtained, produces a remarkable 
20,000 pg Ap peptide per ml of culture medium after growth for 36 hours in 24 well 
plates. Production of Ap peptide under various growth conditions is summarized in 
Table 2. 

30 
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TABLE 1 

Release of Ap peptide into the culture medium 48 hours after transient 
transfection of HEK293 cells with the indicated vectors containing wildtype or 
modified APP. Values tabulated are mean + SD and P-value for pairwise comparison 
5 using Student's t-test assuming unequal variances. 



APP Construct 


Ap 1-40 peptide 
(pg/ml) 


Fold Increase 


P-value 


pIRES-EGFP vector 


147 + 28 


1.0 




wt APP695 (142.3) 


194+15 


1.3 


0.051 


wt APP695-KK( 124.1) 


424 + 34 


2.8 


3 x 10-5 


APP695-Sw (143.3) 


457 + 65 


3.1 


2 x 10-3 


APP695-SwKK (125.3) 


1308 + 98 


8.9 


3 x 10-4 



20 



25 
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TABLE 2 

Release of Ap peptide from HEK 125.3 cells under various growth conditions. 



Type of Culture 


Volume of 


Duration of 


AP 1-40 


AP 1-42 


Plate 


Medium 


Culture 


(pg/fnl) 


(pg/ml) 


24 well plate 


400 ul 


36 hr 


28,036 


1,439 



10 

Example 7 

Antisense oligomer inhibition of Abeta processing in HEK125.3 cells 

The sequences of Hu-Aspl and Hu-Asp2 were provided to Sequitur, Inc 

15 (Natick, MA) for selection of targeted sequences and design of 2nd generation 
chimeric antisense oligomers using prorietary technology (Sequitur Ver. D Pat 
pending #3002). Antisense oligomers Lot# S644, S645, S646 and S647 were targeted 
against Aspl. Antisense oligomers Lot# S648, S649, S650 and S651 were targeted 
against Asp2. Control antisense oligomers Lot# S652, S653, S655, and S674 were 

20 targeted against an irrelevant gene and antisense oligomers Lot #S656, S657, S658, 
and S659 were targeted against a second irrelevant gene. 

For transfection with the antisense oligomers, HEK125.3 cells were grown to 
about 50% confluence in 6 well plates in Minimal Essential Medium (MEM) 
supplemented with 10% fetal calf serum. A stock solution of oligofectin G (Sequitur 

25 Inc., Natick, MA) at 2 mg/ml was diluted to 50 |ig/ml in serum free MEM. 

Separately, the antisense oligomer stock solution at 100 p.M was diluted to 800 nM in 
Opti-MEM (GIBCO-BRL, Grand Island, NY). The diluted stocks of oligofectin G 
and antisense oligomer were then mixed at a ratio of 1 : 1 and incubated at room 
temperature. After 15 minutes incubation, the reagent was diluted 10 fold into MEM 

30 containing 10% fetal calf serum and 2 ml was added to each well of the 6 well plate 
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after first removing the old medium. After transfection, cells were grown in the 
continual presence of the oligofectin G/antisense oligomer. To monitor AP peptide 
release, 400 \i\ of conditioned medium was removed periodically from the culture 
well and replaced with fresh medium beginning 24 hours after transfection. AP 
5 peptides in the conditioned medium were assayed via immunoprecipitation and 

Western blotting. Data reported are from culture supernatants harvested 48 hours 
after transfection. 

The 16 different antisense oligomers obtained from Sequitur Inc. were 
transfected separately into HEK125.3 cells to determine their affect on AP peptide 

10 processing. Only antisense oligomers targeted against Asp2 significantly reduced 

Abeta processing by HEK125.3 cells. Both Ap (1-40) and Ap (1-42) were inhibited 
by the same degree. In Table 3, percent inhibition is calculated with respect to 
untransfected cells. Antisense oligomer reagents giving greater than 50% inhibition 
are marked with an asterisk. Of the reagents tested, 3 or 4 antisense oligomers targeted 

15 against Aspl gave an average 52% inhibition of AP(l-40) processing and 47% 

inhibition of AP(l-42) processing. For Asp2, 4 of 4 antisense oligomers gave greater 
than 50% inhibition with an average inhibition of 62% of AP(l-40) processing and 
60% for AP(l-42) processing. 

20 



25 



30 
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TABLE 3 

Inhibition of Ap peptide release from HEK125.3 cells treated with antisense 
oligomers. 

5 



Gene Targeted 


Antisense Oligomer 


Abeta(l-40) 


Abeta(l-42) 


Aspl-1 


S644 


62%* 


56%* 


Asp 1-2 


S645 


41%* 


38%* 


Asp 1-3 


S646 


52%* 


46%* 


Asp 1-4 


S647 


6% 


25%* 


Asp2-1 


S648 


71%* 


67%* 


Asp2-2 


S649 


83%* 


76%* 


Asp2-3 


S650 


46%* 


50%* 


Asp2-4 


S651 


47%* 


46%* 


Conl-1 


S652 


13% 


18% 


Con 1-2 


S653 


35% 


30% 


Con 1-3 


S655 


9% 


18% 


Con 1-4 


S674 


29% 


18% 


Con2-l 


S656 


12% 


18% 


Con2-2 


S657 


16% 


19% 


Con2-3 


S658 


8% 


35% 


Con2-4 


S659 


3% 


18% 



Since HEK293 cells derive from kidney, the experiment was extended to 
25 human DvlR-32 neuroblastoma cells which express all three APP isoforms and which 
release AP peptides into conditioned medium at measurable levels. [See Neill et al, 
J. NeuroSci. Res., (1994) 39: 482-93; and Asami-Odaka et aL t Biochem., (1995) 
34:10272-8.] Essentially identical results were obtained in the neuroblastoma cells as 
the HEK293 cells. As shown in Table 3B, the pair of Asp2 antisense oligomers 
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reduced Asp2 mRNA by roughly one-half, while the pair of reverse control oligomers 
lacked this effect (Table 3B). 

Table 3B 

5 Reduction of Ap40 and Ap42 in human neuroblastoma IMR-32 cells and mouse 

neuroblastoma Neuro-2A cells treated with Asp2 antisense and control oligomers as 
indicated. Oligomers were transfected in quadruplicate cultures. Values tabulated are 
normalized against cultures treated with oligofectin-G™ only (mean + SD, ** _ 
p<0.001 compared to reverse control oligomer). 
10 k 





IMR-32 cells 


Neuro-2A cells 




Asp2 
mRNA 


Ap40 


AP42 


AP40 


AP42 


Asp2-1A 


-75% 


-49 + 2%** 


-42 + 
14%** 


-70 + 
7%** 


-67 + 2%** 


Asp2-1R 


0.16 


-0 + 3% 


21.26 


-9+ 15% 


1.05 


Asp2-2A 


-39% 


-43 + 3%** 


-44+18%** 


-61 

+12%** 


-61 +12%** 


Asp2-2R 


0.47 


12.2 


19.22 


6.15 


-8 + 10% 



Together with the reduction in Asp2 mRNA there was a concomitant reduction in the 
release of AP40 and Ap42 peptides into the conditioned medium. Thus, Asp2 

20 functions directly or indirectly in a human kidney and a human neuroblastoma cell 
line to facilitate the processing of APP into Ap peptides. Molecular cloning of the 
mouse Asp2 cDNA revealed a high degree of homology to human (>96% amino acid 
identity, see Example 3), and indeed, complete nucleotide identity at the sites targeted 
by the Asp2-1 A and Asp2-2A antisense oligomers. Similar results were obtained in 

25 mouse Neuro-2a cells engineered to express APP-Sw-KK. The Asp2 antisense 

oligomers reduced release of Ap peptides into the medium while the reverse control 
oligomers did not (Table 3B). Thus, the three antisense experiments with HEK293, 
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1MR-32 and Neuro-2a cells indicate that Asp2 acts directly or indirectly to facilitate 
AP processing in both somatic and neural cell lines. 

Example 8 

5 Demonstration of Hu-Asp2 P-Secretase Activity in Cultured Cells 

Several mutations in APP associated with early onset Alzheimer's disease 
have been shown to alter AP peptide processing. These flank the - and C-terminal 
cleavage sites that release AP from APP. These cleavage sites are referred to as the 
P-secretase and y-secretase cleavage sites, respectively. Cleavage of APP at the 

10 P-secretase site creates a C-terminal fragment of APP containing 99 amino acids of 
1 1,145 daltons molecular weight. The Swedish KM-NL mutation immediately 
upstream of the p-secretase cleavage site causes a general increase in production of 
both the 1-40 and 1-42 amino acid forms of Ap peptide. The London VF mutation 
(V717-F in the APP770 isoform) has little effect on total AP peptide production, but 

15 appears to preferentially increase the percentage of the longer 1-42 amino acid form of 
Ap peptide by affecting the choice of P-secretase cleavage site used during APP 
processing. Thus, we sought to determine if these mutations altered the amount and 
type of AP peptide produced by cultured cells cotransfected with a construct directing 
expression of Hu-Asp2. 

20 Two experiments were performed which demonstrate Hu-Asp2 P-secretase 

activity in cultured cells. In the first experiment, treatment of HEK125.3 cells with 
antisense oligomers directed against Hu-Asp2 transcripts as described in Example 7 
was found to decrease the amount of the C-terminal fragment of APP created by 
p-secretase cleavage (CTF99) (Figure 9). This shows that Hu-Asp2 acts directly or 

25 indirectly to facilitate P-secretase cleavage. In the second experiment, increased 
expression of Hu-Asp2 in transfected mouse Neuro2A cells is shown to increase 
accumulation of the CTF99 P-secretase cleavage fragment (Figure 10). This increase 
is seen most easily when a mutant APP-KK clone containing a C-terminal di-lysine 
motif is used for transfection. A further increase is seen when Hu-Asp2 is 
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cotransfected with APP-Sw-KK containing the Swedish mutation KM -NL. The 
. Swedish mutation is known to increase cleavage of APP by the p-secretase. 

A second set of experiments demonstrate Hu-Asp2 facilitates y-secretase 
activity in cotransfection experiments with human embryonic kidney HEK293 cells. 
5 Cotransfection of Hu-Asp2 with an APP-KK clone greatly increases production and 
release of soluble Api-40 and Api-42 peptides from HEK293 cells. There is a 
proportionately greater increase in the release of Apl-42. A further increase in 
production of Ap 1-42 is seen when Hu-Asp2 is cotransfected with APP-VF (SEQ ID 
No. 13 [nucleotide] and SEQ ID No. 14 [amino acid]) or APP-VF-KK SEQ ID No. 19 

10 [nucleotide] and SEQ ID No. 20 [amino acid]) clones containing the London mutation 
V717^F. The V717-F mutation is known to alter cleavage specificity of the APP 
y-secretase such that the preference for cleavage at the AP42 site is increased. Thus, 
Asp2 acts directly or indirectly to facilitate y-secretase processing of APP at the P42 
cleavage site. 

1 5 Materials 

Antibodies 6E10 and 4G8 were purchased from Senetek (St. Louis, MO). 
Antibody 369 was obtained from the laboratory of Paul Greengard at the Rockefeller 
University. Antibody C8 was obtained from the laboratory of Dennis Selkoe at the 
Harvard Medical School and Brigham and Women's Hospital. 
20 APP Constructs used 

The APP constructs used for transfection experiments comprised the following 

APP: wild-type APP695 (SEQ ID No. 9 and No. 10) 

APP-Sw: APP695 containing the Swedish KM-NL mutation (SEQ ID No. 1 1 
and No. 12 , wherein the lysine (K) at residue 595 of APP695 is changed to 
25 asparagine (N) and the methionine (M) at residue 596 of APP695 is changed to 
leucine (L).), 

APP-VF: APP695 containing the London V-F mutation (SEQ ID Nos. 13 & 
14) (Affected residue 71 7 of the APP770 isoform corresponds with residue 642 of the 
APP695 isoform. Thus, APP-VF as set in SEQ ID NO: 14 comprises the APP695 
30 sequence, wherein the valine (V) at residue 642 is changed to phenylalanine (F).) 
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APP-KK: APP695 containing a C-terminal KK motif (SEQ ID Nos. 15 & 16), 
APP-Sw-KK: APP695-Sw containing a C-terminal KK motif (SEQ ID No. 17 

& 18), 

APP-VF-KK: APP695-VF containing a C-terminal KK motif (SEQ ED Nos. 
5 19&20). 

These were inserted into the vector pIRES-EGFP (Clontech, Palo Alto CA) 
between the Not\ and BstXl sites using appropriate linker sequences introduced by 
PCR. 



10 Transfection of antisense oligomers or plasmid DNA constructs in HEK293 cells, 
HEK125. 3 cells and Neuro-2A cells , 

Human embryonic kidney HEK293 cells and mouse Neuro-2a cells were 
transfected with expression constructs using the Lipofectamine Plus reagent from 
Gibco/BRL. Cells were seeded in 24 well tissue culture plates to a density of 70-80% 

1 5 confluence. Four wells per plate were transfected with 2 [ig DNA (3:1, 

APP:cotransfectant), 8 jil Plus reagent, and 4 \x\ Lipofectamine in OptiMEM. 
OptiMEM was added to a total volume of 1 ml, distributed 200 \i\ per well and 
incubated 3 hours. Care was taken to hold constant the ratios of the two plasmids used 
for cotransfection as well as the total amount of DNA used in the transfection. The 

20 transfection media was replaced with DMEM, 10%FBS, NaPyruvate, with 

antibiotic/antimycotic and the cells were incubated under normal conditions (37°C, 
5% C0 2 ) for 48 hours. The conditioned media were removed to polypropylene tubes 
and stored at -80°C until assayed for the content of Apl-40 and Apl-42 by EIA as 
described in the preceding examples. Transfection of antisense oligomers into 

25 HEK125.3 cells was as described in Example 7. 

Preparation of cell extracts, Western blot protocol 

Cells were harvested after being transfected with plasmid DNA for about 60 
hours. First, cells were transferred to 15-ml conical tube from the plate and 
centrifuged at 1,500 rpm for 5 minutes to remove the medium. The cell pellets were 

30 washed once with PBS. We then lysed the cells with lysis buffer (10 mM HEPES, pH 
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7.9, 150 mM NaCl, 10% glycerol, 1 mM EGTA, I mM EDTA, 0.1 mM sodium 
vanadate and 1% NP-40). The tysed cell mixtures were centrifuged at 5000 rpm and 
the supernatant was stored at -20°C as the cell extracts. Equal amounts of extracts 
from HEK125.3 cells transfected with the Asp2 antisense oligomers and controls were 
5 precipitated with antibody 369 that recognizes the C-terminus of APP and then 

CTF99 was detected in the immunoprecipitate with antibody 6E10. The experiment 
was repeated using C8, a second precipitating antibody that also recognizes the 
C-terminus of APP. For Western blot of extracts from mouse Neuro-2a cells 
cotransfected with Hu-Asp2 and APP-KK, APP-Sw-KK, APP-VF-KK or APP-VF, 
10 equal amounts of cell extracts were electrophoresed through 4-10% or 10-20% Tricine 
gradient gels (NO VEX, San Diego, CA). Full length APP and the CTF99 P-secretase 
product were detected with antibody 6E10. 
Results 

Transfection of HEK125.3 cells with Asp2-1 or Asp2-2 antisense oligomers 

15 reduces production of the CTF P-secretase product in comparison to cells similarly 
transfected with control oligomers having the reverse sequence (Asp2-1 reverse & 
Asp2-2 reverse), see Figure 9. Correspondingly, cotransfection of Hu-Asp2 into 
mouse Neuro-2a cells with the APP-KK construct increased the formation of CTF99. 
(See Fig. 10.) This was further increased if Hu-Asp2 was coexpressed with 

20 APP-Sw-KK, a mutant form of APP containing the Swedish KM-NL mutation that 
increases p-secretase processing. 

Effects of Asp2 on the production of Ab peptides from endogenously 
expressed APP isoforms were assessed in HEK293 cells transfected with a construct 
expressing Asp2 or with the empty vector after selection of transformants with the 

25 antibiotic G418. AP40 production was increased in cells transformed with the Asp2 
construct in comparison to those transformed with the empty vector DNA. Ap40 
levels in conditioned medium collected from the Asp2 transformed and control 
cultures was 424 ± 45 pg /ml and 1 13 ± 58 pg/ml, respectively (pO.001 ). Ap42 
release was below the limit of detection by the E1A, while the release of sAPPa was 

30 unaffected, 1 12 ± 8 ng/ml versus 1 1 1 ± 40 ng/ml. This further indicates that Asp2 
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acts directly or indirectly to facilitate the processing and release of AP from 
endogenously expressed APP. 

Co-transfection of Hu-Asp2 with APP has little effect on AP40 production but 
increases AP42 production above background (Table 4). Addition of the di-lysine 
5 motif to the C-terminus of APP increases AP peptide processing about two fold, 
although AP40 and AP42 production remain quite low (352 pg/ml and 21 pg/ml, 
respectively). Cotransfection of Asp2 with APP-KK further increases both AP40 and 
AP42 production. 

The APP V717-F mutation has been shown to increase y-secretase processing 
10 at the AP42 cleavage site. Cotransfection of Hu-Asp2 with the APP-VF or 

APP-VF-KK constructs increased Ap42 production (a two fold increase with APP-VF 
and a four-fold increase with APP-VF-KK, Table 4), but had mixed effects on Ap40 
production (a slight decrease with APP-VF, and a two fold increase with APP-VF-KK 
in comparison to the pcDNA cotransfection control. Thus, the effect of Asp2 on 
1 5 Ap42 production was proportionately greater leading to an increase in the ratio of 

AP42/total Ab. Indeed, the ratio of AP42/total AP reaches a very high value of 42% in 
HEK293 cells cotransfected with Hu-Asp2 and APP-VF-KK. 



20 
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Table 4 

Results of cotransfecting Hu-Asp2 or pcDNA plasmid DNA with various APP 
constructs containing the V717-F mutation that modifies y-secretase processing. 
5 Cotransfection with Asp2 consistently increases the ratio of Ap42/total A0. Values 
tabulated are AP peptide pg/ml. 
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Example 9 
Bacterial expression of human Asp2(a) 

5 Expression of recombinant Hu-Asp2(a) in E. coli. 

Hu-Asp2(a) can be expressed in E. coli after addition of N-terminal sequences 
such as a T7 tag (SEQ ID No. 21 and No. 22) or a T7 tag followed by a caspase 8 
leader sequence (SEQ ID No. 23 and No. 24). Alternatively, reduction of the GC 
content of the 5' sequence by site directed mutagenesis can be used to increase the 

10 yield of Hu-Asp2 (SEQ ID No. 25 and No. 26). In addition, Asp2(a) can be 

engineered with a proteolytic cleavage site (SEQ ID No. 27 and No. 28). To produce 
a soluble protein after expression and refolding, deletion of the transmembrane 
domain and cytoplasmic tail, or deletion of the membrane proximal region, 
transmembrane domain, and cytoplasmic tail is preferred. Any materials (vectors, 

15 host cells, etc.) and methods described herein to express Hu-Asp2(a) should in 
principle be equally effective for expression of Hu-Asp2(b). 
Methods 

PCR with primers containing appropriate linker sequences was used to 
assemble fusions of Asp2(a) coding sequence with N-terminal sequence modifications 

20 including a T7 tag (SEQ ID Nos. 21 and 22) or a T7-caspase 8 leader (SEQ ID Nos. 
23 and 24). These constructs were cloned into the expression vector pet23a(+) 
[Novagen] in which a T7 promoter directs expression of a T7 tag preceding a 
sequence of multiple cloning sites. To clone Hu-Asp2 sequences behind the T7 leader 
of pet23a+, the following oligonucleotides were used for amplification of the selected 

25 Hu-Asp2(a) sequence: #553=GTGGATCCACCCAGCACGGCATCCGGCTG (SEQ 
ID No. 35), #554=GAAAGCTTTCATGACTCATCTGTCTGTGGAATGTTG (SEQ 
ED No. 36) which placed BamHI and Hindm sites flanking the 5' and 3* ends of the 
insert, respectively. The Asp2(a) sequence was amplified from the full length Asp2(a) 
cDNA cloned into pcDNA3.1 using the Advantage-GC cDNA PCR [Clontech] 

30 following the manufacturer's supplied protocol using annealing & extension at 68°C in 
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a two-step PCR cycle for 25 cycles. The insert and vector were cut with BamHI and 
Hindlll, purified by electrophoresis through an agarose gel, then ligated using the 
Rapid DNA Ligation kit [Boerhinger Mannheim]. The ligation reaction was used to 
transform the E. coli strain JM109 (Promega) and colonies were picked for the 
5 purification of plasmid (Qiagen,Qiaprep minispin) and DNA sequence analysis . For 
inducible expression using induction with isopropyl b-D-thiogalactopyranoside 
(IPTG), the expression vector was transferred into E. coli strain BL21 (Statagene). 
Bacterial cultures were grown in LB broth in the presence of ampicillin at 100 ug/ml, 
and induced in log phase growth at an OD600 of 0.6-1.0 with 1 mM IPTG for 4 hour 

10 at 37°C. The cell pellet was harvested by centrifugation. 

To clone Hu-Asp2 sequences behind the T7 tag and caspase leader (SEQ ED 
Nos. 23 and 24), the construct created above containing the T7-Hu-Asp2 sequence 
(SEQ ID Nos. 21 and 22) was opened at the BamHI site, and then the phosphorylated 
caspase 8 leader oligonucleotides 

1 5 #559=GATCGATGACTATCTCTG ACTCTCCGCGTGAACAGGACG (SEQ ID No. 
37), #560=GATCCGTCCTGTTCACGCGGAGAGTCAGAGATAGTCATC (SEQ 
ID No. 38) were annealed and ligated to the vector DNA. The 5' overhang for each set 
of oligonucleotides was designed such that it allowed ligation into the BamHI site but 
not subsequent digestion with BamHI. The ligation reaction was transformed into 

20 JM109 as above for analysis of protein expression after transfer to E. coli strain BL21 . 

In order to reduce the GC content of the 5' terminus of asp2(a), a pair of 
antiparallel oligos were designed to change degenerate codon bases in 15 amino acid 
positions from G/C to A/T (SEQ ID Nos. 25 and 26). The new nucleotide sequence at 
the 5' end of asp2 did not change the encoded amino acid and was chosen to optimize 

25 E. Coli expression. The sequence of the sense linker is 5' 

CGGCATCCGGCTGCCCCTGCGTAGCGGTCTGGGTGGTGCTCCACTGGGTCT 
GCGTCTGCCCCGGGAGACCGACGAA G 3' (SEQ ID No. 39). The sequence of 
the antisense linker is : 5' 

CTTCGTCGGTCTCCCGGGGCAGACGCAGACCCAGTGGAGCACCACCCAGA 
30 CCGCTACGCAGGGGCAGCCGGATGCCG 3* (SEQ ID No. 40). After annealing 
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the phosphorylated linkers together in 0.1 M NaCl-10 mM Tris, pH 7.4 they were 
ligated into unique Cla I and Sma I sites in Hu-Asp2 in the vector pTAC. For 
inducible expression using induction with isopropyl b-D-thiogalactopyranoside 
(IPTG), bacterial cultures were grown in LB broth in the presence of ampicillin at 100 
5 ug/ml, and induced in log phase growth at an OD600 of 0.6-1.0 with 1 mM IPTG for 
4 hour at 37°C. The cell pellet was harvested by centrifugation. 

To create a vector in which the leader sequences can be removed by limited 
proteolysis with caspase 8 such that this liberates a Hu-Asp2 polypeptide beginning 
with the N-terminal sequence GSFV (SEQ ID Nos. 27 and 28), the following 
10 procedure was followed. Two phosphorylated oligonucleotides containing the 
caspase 8 cleavage site IETD, #571=5 f 

GATCGATGACTATCTCTGACTCTCCGCTGGACTCTGGTATCGAAACCGACG 
(SEQ ID No. 41) and #572= 

GATCCGTCGGTTTCGATACCAGAGTCCAGCGGAGAGTCAGAGATAGTCAT 

15 C (SEQ ID No. 42) were annealed and ligated into pET23a+ that had been opened 
with BamHI. After transformation into JM109, the purified vector DNA was 
recovered and orientation of the insert was confirmed by DNA sequence analysis. 

The following oligonucleotides were used for amplification of the selected 
Hu-Asp2(a) sequence: 

20 #573=5'AAGGATCCTTTGTGGAGATGGTGGACAACCTG, (SEQ ID No. 43) 

#554=GAAAGCTTTCATGACTCATCTGTCTGTGGAATGTTG (SEQ ID No. 44) 
which placed BamHI and HindHI sites flanking the 5* and 3' ends of the insert, 
respectively. The Hu-Asp2(a) sequence was amplified from the full length Hu- 
Asp2(a) cDNA cloned into pcDNA3.1 using the Advantage-GC cDNA PCR 

25 [Clontech] following the manufacturer's supplied protocol using annealing & 

extension at 68 °C in a two-step PCR cycle for 25 cycles. The insert and vector were 
cut with BamHI and Hindlll, purified by electrophoresis through an agarose gel, then 
ligated using the Rapid DNA Ligation kit [Boerhinger Mannheim]. The ligation 
reaction was used to transform the E. coli strain JM109 [Promega] and colonies were 

30 picked for the purification of plasmid (Qiagen,Qiaprep minispin) and DNA sequence 
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analysis . For inducible expression using induction with isopropyl 
b-D-thiogalactopyranoside (EPTG), the expression vector was transferred into E. coli 
strain BL21 (Statagene). Bacterial cultures were grown in LB broth in the presence of 
ampicillin at 1 00 ug/ml, and induced in log phase growth at an OD600 of 0.6-1 .0 with 
5 1 mM TPTG for 4 hour at 37°C. The cell pellet was harvested by.centrifugation. 

To assist purification, a 6-His tag can be introduced into any of the above 
constructs following the T7 leader by opening the construct at the BamHI site and 
then ligating in the annealed, phosphorylated oligonucleotides containing the six 
histidine sequence #565=GATCGCATCATCACCATCACCATG (SEQ ID No. 45), 

10 #566=GATCCATGGTGATGGTGATGATGC (SEQ ID No. 46). The 5' overhang for 
each set of oligonucleotides was designed such that it allowed ligation into the BamHI 
site but not subsequent digestion with BamHI. 
Preparation of Bacterial Pellet: 

36.34g of bacterial pellet representing 10.8L of growth was dispersed into a 

15 total volume of 200ml using a 20mm tissue homogenizer probe at 3000 to 5000 rpm 
in 2M KC1, 0.1M Tris, 0.05M EDTA, ImM DTT. The conductivity adjusted to about 
193mMhos with water. After the pellet was dispersed, an additional amount of the 
KC1 solution was added, bringing the total volume to 500 ml. This suspension was 
homogenized further for about 3 minutes at 5000 rpm using the same probe. The 

20 mixture was then passed through a Rannie high-pressure homogenizer at 1 0,000psi. 

In all cases, the pellet material was carried forward, while the soluble fraction 
was discarded. The resultant solution was centrifuged in a GSA rotor for 1 hour at 
1 2,500 rpm. The pellet was resuspended in the same solution (without the DTT) using 
the same tissue homogenizer probe at 2,000 rpm. After homogenizing for 5 minutes 

25 at 3000 rpm, the volume was adjusted to 500ml with the same solution, and spun for 1 
hour at 12,500 rpm. The pellet was then resuspended as before, but this time the final 
volume was adjusted to 1.5L with the same solution prior to homogenizing for 5 
minutes. After centrifuging at the same speed for 30 minutes, this procedure was 
repeated. The pellet was then resuspended into about 150ml of cold water, pooling 

30 the pellets from the six centrifuge tubes used in the GSA rotor. The pellet has 
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homogenized for 5 minutes at 3,000 rpm, volume adjusted to 250ml with cold water, 
then spun for 30 minutes. Weight of the resultant pellet was 17.75g. 

Summary: Lysis of bacterial pellet in KC1 solution, followed by centrifugation 
in a GSA rotor was used to initially prepare the pellet. The same solution was then 
5 used an additional three times for resuspension/homogenization. A final water 
wash/homogenization was then performed to remove excess KC1 and EDTA. 

Solublization of Recombinant Hu-Asp2(a): 

A ratio of 9-10ml/gram of pellet was utilized for solubilizing the rHuAsp2L from the 
1 0 pellet previously described. 1 7.75g of pellet was thawed, and 1 50ml of 8M guanidine 
HC1, 5mM PME, 0.1% DEA, was added. 3M Tris was used to titrate the pH to 8.6. 
The pellet was initially resuspended into the guanidine solution using a 20 mm tissue 
homogenizer probe at 1000 rpm. The mixture was then stirred at 4°C for 1 hour prior 
to centrifugation at 12,500 rpm for 1 hour in GSA rotor. The resultant supernatant 
15 was then centrifuged for 30 minutes at 40,000 x g in an SS-34 rotor. The final 
supernatant was then stored at — 20°C, except for 50 ml. 

Immobilized Nickel Affinity Chromatography of Solubilized Recombinant Hu- 

Asp2(a): 

20 

The following solutions were utilized: 

A) 6M Guanidine HC1, 0.1M NaP, pH 8.0, 0.01M Tris, 5mM PME, 0.5mM 
Imidazole 

A') 6M Urea, 20mM NaP, pH 6.80, 50mM NaCl 
25 B') 6M Urea, 20mM NaP, pH 6.20, 50mM NaCl, 12mM Imidazole 
C) 6M Urea, 20mM NaP, pH 6.80, 50mM NaCl, 300mM Imidazole 
Note: Buffers A' and C were mixed at the appropriate ratios to give intermediate 
concentrations of Imidazole. 

The 50ml of solubilized material was combined with 50ml of buffer A prior to adding 
30 to 1 00- 125ml Qiagen Ni-NTA SuperFlow (pre-equilibrated with buffer A) in a 5 x 
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10cm Bio-Rad econo column. This was shaken gently overnight at 4°C in the cold 
room. 

Chromatography Steps: 
5 Drained the resultant flow through. 

Washed with 50ml buffer A (collecting into flow through fraction) 

Washed with 250ml buffer A (wash 1) 

Washed with 250ml buffer A (wash 2) 

Washed with 250ml buffer A' 
10 Washed with 250ml buffer B' 

Washed with 250ml buffer A' 

Eluted with 250ml 75mM Imidazole 

Eluted with 250ml 150mM Imidazole (150-1) 

Eluted with 250ml 150mM Imidazole (150-2) 
15 Eluted with 250ml 300mM Imidazole (300-1) 

Eluted with 250ml 300mM Imidazole (300-2) 

Eluted with 250ml 300mM Imidazole (300-3) 

Chromatography Results: 

20 The Hu-Asp(a) eluted at 75mM Imidazole through 300mM Imidazole. The 75mM 
fraction, as well as the first 150mM Imidazole (150-1) fraction contained 
contaminating proteins as visualized on Coomassie Blue stained gels. Therefore, 
fractions 150-2 and 300-1 will be utilized for refolding experiments since they 
contained the greatest amount of protein as visualized on a Coomassie Blue stained 

25 gel. 

Refolding Experiments of Recombinant Hu-Asp2(a): 
Experiment 1 : 

Forty ml of 1 50-2 was spiked with 1M DTT, 3M Tris, pH 7.4 and DEA to a final 
30 concentration of 6mM, 50mM, and 0.1% respectively. This was diluted suddenly 
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(while stirring) with 200ml of (4°C) cold 20mM NaP, pH 6,8, 150mM NaCl. This 
dilution gave a final Urea concentration of 1M. This solution remained clear, even if 
allowed to set open to the air at room temperature (RT) or at 4°C . 
After setting open to the air for 4-5 hours at 4°C, this solution was then dialyzed 
5 overnight against 20 mM NaP, pH 7.4, 150 mM NaCl, 20% glycerol. This method 
effectively removes the urea in the solution without precipitation of the protein. 

Experiment 2: 

Some of the 150-2 eluate was concentrated 2x on an Amicon Centriprep, 10,000 
1 0 MWCO, then treated as in Experiment 1. This material also stayed in solution, with 
no visible precipitation. 

Experiment 3: 

89ml of the 150-2 eluate was spiked with 1M DTT, 3M Tris, pH 7.4 and DEA to a 
1 5 final concentration of 6mM, 50mM, and 0. 1% respectively. This was diluted 

suddenly (while stirring) with 445 ml of (4°C) cold 20 mM NaP, pH 6.8, 150 mM 

NaCl. This solution appeared clear, with no apparent precipitation. The solution was 

removed to RT and stirred for 1 0 minutes prior to adding MEA to a final 

concentration of 0.1 mM. This was stirred slowly at RT for 1 hour. Cystamine and 
20 CuS0 4 were then added to final concentrations of 1 mM and 10 \xM respectively. The 

solution was stirred slowly at RT for 10 minutes prior to being moved to the 4°C cold 

room and shaken slowly overnight, open to the air. 

The following day, the solution (still clear, with no apparent precipitation) was 

centrifuged at 100,000 x g for 1 hour. Supernatants from multiple runs were pooled, 
25 and the bulk of the stabilized protein was dialyzed against 20mM NaP, pH 7.4, 150 

mM NaCl, 20% glycerol. After dialysis, the material was stored at -20°C. 

Some (about 10 ml) of the protein solution (still in 1M Urea) was saved back 

for biochemical analyses, and frozen at -20°C for storage. 

30 



-80- 



WO 01/23533 



PCT/US00/26080 



Example 10 

Expression of Hu-Asp2 and Derivatives in Insect Cells 

Any materials (vectors, host cells, etc.) and methods that are useful to express 
Hu-Asp2(a) should in principle be equally effective for expression of Hu-Asp2(b). 
5 Expression by baculovirus infection. 

The coding sequence of Hu-Asp2(a) and Hu-ASp2(b) and several derivatives 
were engineered for expression in insect cells using the PCR. For the full-length 
sequence, a 5 '-sense oligonucleotide primer that modified the translation initiation 
site to fit the Kozak consensus sequence was paired with a 3'-antisense primer that 

10 contains the natural translation termination codon in the Hu-Asp2 sequence. PCR 

amplification of the pcDNA3.1(hygro)/Hu-Asp2(a) template was used to prepare two 
derivatives of Hu-Asp2(a) or Hu-Asp(b) that delete the C-terminal transmembrane 
domain (SEQ ID Nos. 29-30 and 50-51, respectively) or delete the transmembrane 
domain and introduce a hexa-histidine tag at the C-terminus (SEQ ID Nos. 31-32 and 

15 52-53) respectively, were also engineered using PCR. The same 5'-sense 

oligonucleotide primer described above was paired with either a 3'-antisense primer 
that (1) introduced a translation termination codon after codon 453 (SEQ ID No. 3) or 
(2) incorporated a hexa-histidine tag followed by a translation termination codon in 
the PCR using pcDNA3.1(hygro)/Hu-Asp-2(a) as the template. In all cases, the PCR 

20 reactions were performed amplified for 1 5 cycles using Pwol DNA polymerase 
(Boehringer-Mannheim) as outlined by the supplier. The reaction products were 
digested to completion with BamHl and Notl and ligated to BamHl and Notl digested 
baculovirus transfer vector pVL1393 (Invitrogen). A portion of the ligations was used 
to transform competent E. coli DH5_ cells followed by antibiotic selection on 

25 LB- Amp. Plasmid DNA was prepared by standard alkaline lysis and banding in CsCl 
to yield the baculovirus transfer vectors pVL1393/Asp2(a), pVL1393/Asp2(a)ATM 
and pVLl 393/Asp2(a)ATM(His) 6 . Creation of recombinant baculoviruses and 
infection of sf9 insect cells was performed using standard methods. 
Expression by transfection 
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Transient and stable expression of Hu-Asp2(a)ATM and 
Hu-Asp2(a)ATM(His) 6 in High 5 insect cells was performed using the insect 
expression vector pIZ/V5-His. The DNA inserts from the expression plasmids vectors 
pVL1393/Asp2(a), pVL1393/Asp2(a)ATM and pVL1393/Asp2(a)ATM(His) 6 were 
5 excised by double digestion with BamHl and Noil and subcloned into BamHl and 

. Notl digested plZ/V5-His using standard methods. The resulting expression plasmids, 
referred to as pIZ/Hu- Asp2ATM and pIZ/Hu-Asp2ATM(His) 6 , were prepared as 
described above. 

For transfection, High 5 insect cells were cultured in High Five serum free 

10 medium supplemented with 10 ^g/ml gentamycin at 27 °C in sealed flasks. 

Transfections were performed using High five cells, High five serum free media 
supplemented with 10 jig/ml gentamycin, and InsectinPlus liposomes (Invitrogen, 
Carlsbad, CA) using standard methods. 

For large scale transient transfections, 1 .2 x 10 7 high five cells were plated in a 

15 150 mm tissue culture dish and allowed to attach at room temperature for 15-30 

minutes. During the attachment time the DNA/ liposome mixture was prepared by 
mixing 6 ml of serum free media, 60 \ig Hu-Asp2(a)ATM/pTZ (+/- His) DNA and 120 
|xl of Insectin Plus and incubating at room temperature for 15 minutes. The plating 
media was removed from the dish of cells and replaced with the DNA/liposome 

20 mixture for 4 hours at room temperature with constant rocking at 2 rpm. An 

additional 6 ml of media was added to the dish prior to incubation for 4 days at 27 °C 
in a humid incubator. Four days post transfection the media was harvested, clarified 
by centrifugation at 500 x g, assayed for Hu-Asp2(a) expression by Western blotting. 
For stable expression, the cells were treated with 50 ng/ml Zeocin and the surviving 

25 pool used to prepared clonal cells by limiting dilution followed by analysis of the 
expression level as noted above. 

Purification of Hu-Asp2(a)ATM and Hu-Asp2(a)ATM(His) 6 

Removal of the transmembrane segment from Hu-Asp2(a) resulted in the 
30 secretion of the polypeptide into the culture medium. Following protein production 
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by either baculovirus infection or transfection, the conditioned medium was 
harvested, clarified by centrifugation, and dialyzed against Tris-HCl (pH 8.0). This 
material was then purified by successive chromatography by anion exchange 
(Tris-HCl, pH 8.0) followed by cation exchange chromatography (Acetate buffer at 
5 pH 4.5) using NaCl gradients. The elution profile was monitored by (1 ) Western blot 
analysis and (2) by activity assay using the peptide substrate described in Example 12. 
For the Hu-Asp2(a)ATM(His) 6 , the conditioned medium was dialyzed against Tris 
buffer (pH 8.0) and purified by sequential chromatography on IMAC resin followed 
by anion exchange chromatography. 
10 Amino-terminal sequence analysis of the purified Hu-Asp2(a)ATM(His) 6 

protein revealed that the signal peptide had been cleaved [TQHGIRLPLR, 
corresponding to SEQ ED NO: 32, residues 22-3]. 

Example 11 

IS Expression of Hu-Asp2(a) and Hu-Asp(b) in CHO cells 

The materials (vectors, host cells, etc.) and methods described herein for 
expression of Hu-Asp2(a) are intended to be equally applicable for expression of 
Hu-Asp2(b). 

Heterologous expression of Hu-Asp-2(a) in CHO-K1 cells 

20 The entire coding sequence of Hu-Asp2(a) was cloned into the mammalian 

expression vector pcDNA3.1(+)Hygro (Invitrogen, Carlsbad, CA) which contains the 
CMV immediate early promoter and bGH polyadenylation signal to drive over 
expression. The expression plasmid, pcDNA3.1(+)Hygro/Hu-Asp2(a), was prepared 
by alkaline lysis and banding in CsCl and completely sequenced on both strands to 

25 verify the integrity of the coding sequence. 

Wild-type Chinese hamster ovary cells (CHO-K1) were obtained from the 
ATCC. The cells were maintained in monolayer cultures in a-MEM containing 10% 
FCS at 37°C in 5% C0 2 . Two 100 mm dishes of CHO-K1 cells (60% confluent) were 
transfected with pcDNA3.1(+)/Hygro alone (mock) or 

30 pcDNA3. 1 (+)Hygro/Hu-Asp2(a) or pcDNA3. l(+)Hygro/Hu-Asp2(b) using the 
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cationic liposome DOTAP as recommended by the supplier (Roche, Indianapolis, IN). 
The cells were treated with the plasmid DNA/liposome mixtures for 15 hours and 
then the medium replaced with growth medium containing 500 Units/ml hygromycin 
B. In the case of pcDNA3. 1 (+)Hygro/Hu-Asp2(a) or (b) transfected CHO-K1 cells, 
5 individual hygromycin B-resistant cells were cloned by limiting dilution. Following 
clonal expansion of the individual cell lines, expression of Hu-Asp2(a) or Hu-Asp2(b) 
protein was assessed by Western blot analysis using a polyclonal rabbit antiserum 
raised against recombinant Hu-Asp2 prepared by expression in E. coli. Near 
confluent dishes of each cell line were harvested by scraping into PBS and the cells 

10 recovered by centrifugation. The cell pellets were resuspended in cold lysis buffer (25 
mM Tris-HCl (pH 8.0)/5 mM EDTA) containing protease inhibitors and the cells 
lysed by sonication. The soluble and membrane fractions were separated by 
centrifugation (105,000 x g, 60 min) and normalized amounts of protein from each 
fraction were then separated by SDS-PAGE. Following electrotransfer of the 

15 separated polypeptides to PVDF membranes, Hu-Asp-2(a) or Hu-Asp2(b) protein was 
detected using rabbit anti-Hu-Asp2 antiserum (1/1000 dilution) and the 
antibody- antigen complexes were visualized using alkaline phosphatase conjugated 
goat anti-rabbit antibodies (1/2500). A specific immunoreactive protein with an 
apparent Mr value of 65 kDa was detected in pcDNA3.1(+)Hygro/Hu-Asp2 

20 transfected cells and not mock-transfected cells. Also, the Hu-Asp2 polypeptide was 
only detected in the membrane fraction, consistent with the presence of a signal 
peptide and single transmembrane domain in the predicted sequence. Based on this 
analysis, clone #5 had the highest expression level of Hu-Asp2(a) protein and this 
production cell lines was scaled up to provide material for purification. 

25 Purification of recombinant Hu-Asp-2(a) from CHO-Kl/Hu-Asp2 clone #5 

In a typical purification, clone #5 cell pellets derived from 20 150 mm dishes 
of confluent cells, were used as the starting material. The cell pellets were 
resuspended in 50 ml cold lysis buffer as described above. The cells were lysed by 
polytron homogenization (2 x 20 sec) and the lysate centrifuged at 338,000 x g for 20 

30 minutes. The membrane pellet was then resuspended in 20 ml of cold lysis buffer 
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containing 50 mM P-octylglucoside followed by rocking at 4 °C for 1 hour. The 
detergent extract was clarified by centrifugation at 338,000 x g for 20 minutes and the 
supernatant taken for further analysis. 

The p-octylglucoside extract was applied to a Mono Q anion exchange 
5 column that was previously equilibrated with 25 mM Tris-HCl (pH 8.0)/50 mM 

P-octylglucoside. Following sample application, the column was eluted with a linear 
gradient of increasing NaCl concentration (0-1 .0 M over 30 minutes) and individual 
fractions assayed by Western blot analysis and for p-secretase activity (see below). 
Fractions containing both Hu-Asp-2(a) immunoreactivity and p-secretase activity 

10 were pooled and dialyzed against 25 mM NaOAc (pH 4.5)/50 mM p-octylglucoside. 
Following dialysis, precipitated material was removed by centrifugation and the 
soluble material chromatographed on a MonoS cation exchange column that was 
previously equilibrated in 25 mM NaOAc (pH 4.5)/ 50 mM p-octylglucoside. The 
column was eluted using a linear gradient of increasing NaCl concentration (0-1.0 M 

15 over 30 minutes) and individual fractions assayed by Western blot analysis and for 
p-secretase activity. Fractions containing both Hu-Asp2 immunoreactivity and 
P-secretase activity were combined and determined to be >95% pure by 
SDS-PAGE/Coomassie Blue staining. 

The same methods were used to express and purify Hu-Asp2(b). 

20 

Example 12 

. Assay of Hu-Asp2 P-secretase activity using peptide substrates 

(}-secretase assay 

Recombinant human Asp2(a) prepared in CHO cells and purified as described 
25 in Example 1 1 was used to assay Asp2(a) proteolytic activity directly. Activity assays 
for Asp2(a) were performed using synthetic peptide substrates containing either the 
wild-type APP p-secretase site (SEVKM 1 DAEFR), the Swedish KM-NL mutation 
(SEVNL i DAEFR), or the Ap40 and 42 y-secretase sites (RRGGV V 1 1 A 1 TVIVGER). 
Reactions were performed in 50 mM 2-[N-morpholino]ethane-sulfonate ("Na-MES," 
30 pH 5.5) containing 1% p-octylglucoside, 70 mM peptide substrate, and recombinant 
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Asp2(a) (1-5 \ig protein) for various times at 37°C. The reaction products were 
quantified by RP-HPLC using a linear gradient from 0-70 B over 30 minutes (A=0.1% 
TFA in water, B=0. l%TFA/10%water/90%AcCN). The elution profile was 
monitored by absorbance at 214 nm. In preliminary experiments, the two product 
5 peaks which eluted before the intact peptide substrate, were confirmed to have the 

sequence DAEFR and SEVNL using both Edman sequencing and MADLI-TOF mass 
spectrometry. Percent hydrolysis of the peptide substrate was calculated by 
comparing the integrated peak areas for the two product peptides and the starting 
material derived from the absorbance at 214 nm. The sequence of cleavage/hydrolysis 
10 products was confirmed using Edman sequencing and MADLI-TOF mass 
spectrometry. 

The behavior of purified Asp2(a) in the proteolysis assays was consistent with 
the prior anti-sense studies which indicated that Asp2(a) possesses P-secretase 
activity. Maximal proteolysis was seen with the Swedish P-secretase peptide, which, 
1 5 after 6 hours, was about 10- fold higher than wild type APP. 

The specificity of the protease cleavage reaction was determined by 
performing the P-secretase assay in the presence of 8 \xM pepstatin A and the presence 
of a cocktail of protease inhibitors (10 \xM leupeptin, 10 \iM E64, and 5 mM EDTA). 
Proteolytic activity was insensitive to both the pepstatin and the cocktail, which 
20 areinhibitors of cathepsin D (and other aspartyl proteases), serine proteases, cysteinyl 
proteases, and metalloproteases, respectively. 

Hu-Asp2(b) when similarly expressed in CHO cells and purified using 
identical conditions for extraction with P-octylglucoside and sequential 
chromatography over Mono Q and Mono S also cleaves the Swedish p-secretase 
25 peptide in proteolysis assays using identical assay conditions. 

Collectively, this data establishes that both forms of Asp2 (Hu-Asp2(a) and 
Hu-Asp2(b)) act directly in cell-free assays to cleave synthetic APP peptides at the p- 
secretase site, and that the rate of cleavage is greatly increased by the Swedish 
KM-NL mutation that is associated with Alzheimer's disease. 
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An alternative P-secretase assay utilizes internally quenched fluorescent 
substrates to monitor enzyme activity using fluorescence spectroscopy in a single 
sample or multiwell format. Each reaction contained 50 mM Na-MES (pH 5.5), 
peptide substrate MCA-EVKMDAEF[K-DNP] (BioSource International) (50 ^M) 
5 and purified Hu-Asp-2 enzyme. These components were equilibrated to 37 °C for 
various times and the reaction initiated by addition of substrate. Excitation was 
performed at 330 nm and the reaction kinetics were monitored by measuring the 
fluorescence emission at 390 nm. To detect compounds that modulate Hu-Asp-2 
activity, the test compounds were added during the preincubation phase of the 
10 reaction and the kinetics of the reaction monitored as described above. Activators are 
scored as compounds that increase the rate of appearance of fluorescence while 
inhibitors decrease the rate of appearance of fluorescence. 



Example 13 

15 Demonstration that Aspl processes APP at the a-secretase site 

Increased expression of an a-secretase candidate gene in human cells 
would be expected to increase basal release of sAPPa and to decrease release of Ap 
peptides. This the effect was observed when full length human Aspl is co-expressed 
with APP in HEK293 cells. The experiment utilized the APP 695 amino acid isoform 

20 which had been modified by the addition of a pair of lysine residues to the C-terminus 
(APP-KK). The C-terminal di-lysine motif increases the intracellular half-life of 
glycosylated APP and consequently the production of both sAPPa and Ap. As shown 
in Table 5, cotransfection of HEK293 cells with APP-KK with Aspl increased the 
production of sAPPa by 3.5 fold (p<0.001) and decreased the production of AP40 by 

25 2.8 fold. Thus, Aspl acts directly or indirectly to facilitate constitutive a-secretase 
cleavage and this effect is competitive with the amyloidogenic processing of APP to 
AP peptides. This implies that mutations or genetic polymorphisms in Aspl may 
affect Ap production by affecting the balance between the competing pathways for 
constitutive a-secretase cleavage and AP peptide production. 

30 

-87- 



WO 01/23533 



PCT/US00/26080 



Table 5. 

Aspl stimulates basal release of sAPPa from 
HEK293 cells after cotransfection with APP-KK. 



Transfection 


sAPPa 


Fold 


AP40 


Fold 




Hg/ml 


Increase 


pg/ml 


Decrease 


Aspl 


3.5 + 1.1 


+3.5 


113 + 7 


-2.8 


pcDNA 


1.0 + 0.2 




321 + 18 





Specific methods used were as follows. The full length Aspl cDNA was 

10 cloned into the vector pcDNA3. l/hygro+(Invitrogen) for transfection studies as 

previously described (Yan et al„ (1999) Nature 402: 533-537). The APP-KK cDNA 
was cloned into the vector pIRES (Clontech) also as previously described. HEK293 
cells were transfected with expression constructs using the Lipofectamine Plus reagent 
from Gibco/BRL. Cells were seeded in 24 well tissue culture plates to a density of 

15 70-80% confluence. Four wells per plate were transfected with 2 \ig DNA (3:1, 
APP:Aspl or empty pcDNA3. 1 ./hygro+ vector), 8jj.1 Plus reagent, and A\x\ 
Lipofectamine in OptiMEM. OptiMEM was added to a total volume of 1 ml, 
distributed 200 ^1 per well and incubated 3 hours. Care was taken to hold constant the 
ratios of the two plasmids used for cotransfection as well as the total amount of DNA 

20 used in the transfection. The transfection media was replaced with DMEM 

supplemented with 10% FBS and NaPyruvate, with antibiotic/antimycotic and the 
cells were incubated under normal conditions (37°, 5% CO ? ) for 48 hours. The 
conditioned media were removed to polypropylene tubes and stored at -80°C until 
assayed for the content of sAPPa or AP40/AP42 by enzyme-linked immunosorbent 

25 assay (EIA) as described above in Example 6. The Ap EIA followed the protocol of 
Pirttila et al (Neuro. Lett. (1999) 249: 21-4) using the 6E 10 monoclonal antibody 
(Senetek) as a capture antibody and biotinylated rabbit antiserum 162 or 165 (New 
York State Institute for Basic Research, Staten Island, NY) for detection of AP40 and 
Ap42, respectively. The 6E10 antibody recognizes residues 1-16 of the Ap peptide. 

30 The sAPPa ETA used LN27 antibody as a capture antibody and biotinylated 6E10 for 
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detection as described previously (Yan et al. y (1999) supra.). The LN27 antibody 
recognized the first 20 amino acids of the human APP peptide. 

Increased cc-secretase activity and concomitant reduction of AP production in 
vivo represents an effect that may be desirable for the prevention, treatment (e.g., to 
5 show the progression of), or cure of Azheimer's disease. Thus, the activities 

demonstrated in this example provide an indication that modulators of Aspl activity, 
that achieve the same effects in vivo, will have utility for Alzheimer's disease therapy. 
Screening methods for such modulators are contemplated as an aspect of the 
invention. 

10 

Example 14 

Expression of Pre-pro-Hu-Aspl and Derivatives in Insect Cells 

Expression ofhu-Asp-lTM(His) 6 by baculovirus infection. 

The coding sequence of pre-pro-Hu-Aspl was engineered for production as a 

15 soluble, secreted form by insect cells. PGR primers were designed to (1) delete the 
predicted transmembrane domain and cytoplasmic tail of Aspl and (2) to introduce a 
Kozak consensus sequence for efficient translatiorial initiation. The primers 
sequences were are follows: sense CGCTTTAAGCTTGCCACCATGGGCGCA 
CTGGCCCGGGCG (SEQ ED NO: 74) and antisense C GCTTTCTC G A GCT AA 

20 TGGTGATGGTGATGGTGCCACAAAATGGGCTCGCTCAAAGA (SEQ ID NO: 
75) which replaced the deleted C-terminal transmembrane and cytoplasmic domains 
with a hexahistidine purification tag. 

PCR reactions were carried out with 100 ng of full length Aspl pcDNA 3.1 
hygro+ construct, 200 M NTPs, 300 nM of each primer, Ix reaction buffer containing 

25 2 mM MgS0 4 , and 5 units of Pwo I DNA polymerase (Roche Biochemicals). The 

reactions were cycled under the following conditions: 94°C for 5 minutes followed by 
1 5 cycles of 94°C for 30 seconds and 72°C for 30 seconds, and then a final extension 
reaction at 72°C for 10 minutes. The predicted amino acid sequence of this PCR 
generated derivative (denoted as Asp- 1 ATM(His) 6 ) is set out as SEQ ID NO: 66. 
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The reaction product was digested to completion with Hindlll-Xhol and 
ligated into the expression vector pIB (Invitrogen) to yield the pIB/Asp-1 ATM(His) 6 
construct. Creation of recombinant baculovirus and infection of sf9 insect cells was 
performed using standard methods known in the art. Sf9 cells were transfected with 
5 either the pIB vector alone or the pIB/Asp- 1 ATM(His) 6 construct utilizing Insectin 
Plus reagent (Invitrogen) according to the manufacturer's instructions. After the 
transfection, the cells were cultured in High Five serum-free media (Invitrogen) for 3 
days. Subsequently, the conditioned medium was harvested and subjected to Western 
blot analysis. This analysis revealed specific expression and secretion of 

10 immunoreactive Asp- 1 ATM(His) 6 polypeptide into the extracellular medium. The 

secreted proteins were detected on the Western blot with either the India probe (Pierce 
Chemicals) specific for the hexahistidine sequence tag or using a rabbit polyclonal 
antiserum. The polyclonal antisera (denoted as UP-199) was generated by injecting 
rabbits with recombinant Asp-1 ATM(His) 6 (SEQ ID NO: 66). This recombinant 

1 5 peptide was prepared by heterologous expression in E.coli. The UP-199 antibody 
recognizes the processed form of Asp-1 ATM. 

Direct analysis with the polyclonal antiserum (UP-199) revealed an 
immunoreactive band of the expected molecular weight (50 kDa) only in pIB/Asp- 
1 ATM(His) 6 transfected cells. This signal was significantly enhanced in concentrated 

20 conditioned medium. A similar pattern was obtained using the India probe. No signal 
was detected in conditioned medium derived from mock-transfected cells using either 
UP-1 99 antisera or the India probe. 

Based on this result, transient and stable transfections of the pIB/Asp- 
1 ATM(His) 6 construct in sf9 insect cells were carried out as described above. Four 

25 days post transient transfection, the culture medium was collected to provide material 
for further characterization. In parallel, sf9 cells were stably transfected with the 
pIB/Asp-1 ATM(His) 6 construct and cultured in High Five serum-free medium 
(Invitrogen) supplemented with 50 j^g/ml blasticidin for approximately 2 weeks. 
After blasticidin selection, the resistant pool of cells was expanded to provide a stable 

30 source of conditioned medium for Asp-1 ATM(His) 6 purification. 
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Purification of recombinant Asp- 1 ATM(His) 6 

Conditioned media, from either transient or stably transfected sf9 cells, were 
5 concentrated approximately 10-fold using a stirred cell concentrator equipped with a 
30,000 MWCO membrane (Spectrum Medical Industries). This concentrate was then 
subjected to ammonium sulfate precipitation to further concentrate the sample and 
provide partial purification. Material precipitating between 0-40% saturation was 
discarded and the resulting supernatant was brought to 80% saturation. Western blot 
10 analysis of the various ammonium sulfate precipitated fractions revealed that the 
majority of the immunoreactive material was contained within the 40-80% 
ammonium sulfate pellet. As a result, this material was subjected to further 
purification. 

The 40-80% ammonium sulfate pellet was redisolved in approximately 1/20 

15 the original volume of Ni+-NTA loading buffer (25 mM Tris-HCl (pH 8.5)/0.5 M 
NaCl/10 mM imidazole). Subsequently, the sample was applied to a Ni+-NTA 
column previously equilibrated in M+-NTA buffer. Following sample application, 
the column was washed with starting buffer (25 mM Tris-HCl (ph 8.5)/ 0.5 M NaCl/ 
20 mM imidazole) until the A 280nm of the column effluent returned to zero. After 

20 washing, the bound recombinant protein was eluted off the column with a linear 

gradient of Ni+-NTA buffer containing increasing concentrations (10 mM, 50 mM, 
100 mM, 250 mM, and 500 mM) of imidazole. The elution profile was monitored by 
Western blot analysis using the UP- 199 antiserum. Immunoreactive Asp-lATM(His) 6 
was detected in the column load and eluted at 50 mM imidazole. NuPAGE gel 

25 analysis of the 50 mM imidazole fraction demonstrated a purity of Asp-1 ATM(His) 6 
of approximately 50%, therefore further purification was required. 

The positive fractions, eluted off the Ni+-NTA column, were then pooled 
(denoted as post-IMAC pool), concentrated using a YM30 membrane (Amicron), and 
dialyzed with 25 mM Tris-HCl (pH 8.0). The dialyzed post-IMAC pool was 

30 fractionated by MonoQ anion exchange chromatography (Amersham-Pharmacia 
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Biotech) gradient elution containing increasing concentrations (0 -0.5 M) of NaCl 
(Buffer A: 25 mM Tris-HCl (pH 8.0) and Buffer B: 25 mM Tris-HCl (pH 8.0)/ 0.5 M 
NaCl). The elution profile was determined by Western blot analysis which indicated 
immunoreactive fractions as those displaying immunoreactivity with the UP- 199 
5 antisera. NuPAGE gel analysis with silver staining demonstrated that the material 
prepared in this manner was >90% pure. The immunoactive fractions eluted off the 
MonoQ anion exchange column were pooled, dialyzed with 25 mM HEPES-Na+ (pH 
8.0), and stored at 4°C until further analysis. 

10 Acid-activation of recombinant Asp- 1 TM(His) 6 

Recombinant Asp-1 ATM(His) 6 migrated with an apparent molecular weight of 
50 kD. Direct N-terminal sequence analysis carried out by automated Edman 
degradation for 20 cycles revealed a unique sequence beginning at Glu 3 (SEQ ED NO: 
67), confirming the identity of the recombinant protein. Computer assisted prediction 

15 of the signal peptidase cleavage site indicated that the pro-form should initiate at Ala 1 , 
suggesting either an unusual processing site by the signal peptidase during secretion 
or an additional processing step that removes an additional two amino acid residues. 

To investigate the mechanism of pro- Asp-1 ATM(His) 6 activation, aliquots of 
the purified protein were incubated in various acidic environments with pH values 

20 ranging from 3.0-8.0 at 37°C for 2 hours. Subsequently, the recombinant proteins 

were analyzed by Western blot. A faster migrating polypeptide species was detected 
after incubation at pH values of 4.0, 4.5 and 5.0. The polypeptide migration was 
unaltered after incubation in environments which were either more acidic (pH 3.0 and 
3.5) or more basic (pH 6.0, 7.0, and 8.0). Sequence analysis of this faster migrating 

25 species revealed that it initiated exclusively at Ala 43 , consistent with removal of a 42 
amino acid residue segment of the pro-peptide that was induced by treatment of the 
pro-enzyme at pH 4.5. The predicted amino acid sequence of the acid processed form 
of Asp-1 ATM(His) 6 is set out as SEQ ID NO: 68. 

To purify the acid-activated form of Asp-1 ATM(His) 6 , the Asp-1 ATM (Hi s) 6 

30 post-lMAC pool (generated as described above) was dialyzed to pH 4.5 and then 
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subjected to affinity chromatography on either pepstatin A agarose or 
sulfolink-PHA-292593E. Following sample application, the column was washed with 
25 mM NaOAc (pH 4.5) and eluted with 50 mM Na-B0 3 (pH 9.5). The positive 
fractions eluted off the columns were dialyzed with 25 mM Hepes-Na (pH 7.5) 
5 overnight at 4°C which resulted in quantitative conversion of the pro-enzyme to the 
acid-processed form (SEQ ID NO: 68) described above. Western blot analysis of the 
elution profile revealed quantitative retention of immunoreactive Asp-1 ATM(His) 6 on 
both affinity resins as evidenced by the lack of Asp-1 ATM (His) 6 in the unbound 
fraction as detected by UP- 199 immunoreactivity on a Western blot. Step elution 50 
10 mM NaB0 3 at pH 9.5 resulted in elution of immunoreactive Asp-1 TM(His) 6 , with 
variable recovery. 

Comparison of the properties of the recombinant soluble catalytic domain of 
Aspl with the properties determined for Asp2 (see Example 10) revealed a number of 
significant differences. Processing of the pre-pro forms of either enzyme is distinct, 

15 with Aspl undergoing efficient processing by the signal peptidase and additional 
processing to remove two additional amino acid residues from the N-terminus. 
Further processing of the pro-form of Aspl was not detected in neutral pH. In 
contrast, recombinant Asp2 produced, under similar conditions, yields an eqimolar 
mixture of the pro-form and a processed form that has 24 amino acid residues of the 

20 pro-segment removed. 

Another distinction between the processing of these two enzymes involves 
processing initiated by acid-treatment. Systematic analysis of acid-induced processing 
of pro-Asp2 revealed that the purified polypeptide did not self-process. In contrast, 
acid dependent processing of pro- Aspl was readily demonstrated (as described 

25 above). Alignment of the self-processing site in Aspl with the processing site in 

Asp2 revealed that these two enzymes are processed at the same position, which is a 
different method of processing as compared with that of other known human aspartyl 
proteases. 

In addition to providing valuable information about Aspl activity, the 
30 discovery of a site of apparent autocatalytic processing of Aspl provides an indication 
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of a peptide sequences (surrounding Ala 43 ) that could be useful for performing 
screening assays to identify modulators of Aspl activity. This idea is explored in 
greater detail in Example 15. 



5 Example 15 

Development of an enzymatic assay for Asp-1 ATM(His) 6 

The relationship between Aspl and APP processing was explored by 
determining if APP cc-secretase, APP p-secretase, or APP y-secretase peptide 
substrates were cleaved by recombinant Asp- 1 ATM(His) 6 . These peptide substrates 

10 included the cc-secretase specific substrates Ap l0 _ 20 and Ap, 2 _ 2 8> * he p-secretase specific 
substrates PHA-95812E (SEVKMDAEFR; SEQ ID NO: 64) and PHA-247574E 
(SEVNLDAEFR; SEQ ID NO: 63), and y-secretase specific substrate PHA-1791 1 IE 
(RRGGVVIATVIVGER; SEQ ID NO: 76). Each reaction consisted of incubating a 
peptide substrate (100 nM) with recombinant Asp-1 ATM(His) 6 for 15 hours at pH 4.5 

15 at 37°C. Reaction products were quantified by RP-HPLC at A 214 nm . The elution 
profiles for Asp-1 ATM(His) 6 were compared to those obtained from parallel Aspl 
experiments. The identity of the cleavage products was determined by MADLI-TOF 
mass spectrometry. Table 6 summarizes the Aspl substrates and indicates the 
cleavage site. 

20 Table 6 

Substrate Preferences of Asp-1 ATM 

SEQ ID NO: 
69 
70 

25 E V N L i D A E F P-Secretase, Sw 71 

72 
73 



P4 


P3 


P2 


Pi 


PT 


pjr 


P3I 


P4' 




G 


L 


A 


L , 


A 


L 


E 


P 


Self Activation 


E 


V 


K 


M , 


D 


A 


E 


F 


P-Secretase, WT 


E 


V 


N 


L i 


D 


A 


E 


F 


P-Secretase, Sw 


L 


V 


F 


F , 


A 


E 


D 


V 


A P 12-28 (a-Secretase) 


K 


L 


V 


F , 


F 


A 


E 


D 


AP l2 . 2S (a-Secretase) 
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The peptides in Table 6 are described using the nomenclature by 
Schechter and Berger (Biochem. Biophys. Res. Commutu 27:157(1967) and Biochem. 
Biophys. Res. Commun. 32:898 (1968)), in which the amino acid residues in the 
peptide substrate that undergo the cleavage are defined as Pi ... P n toward the N- 
5 terminus and P,' . . . P n * toward the C-terminus. Therefore, the scissile bond is 
between the P, and the Pi' residue of the peptide subunits and is denoted herein 
throughout with a hyphen between the Pi and the P/. 

Digestion of the a-secreatse substrate (A 12 . 2g ) revealed two Aspl cleavage 
sites. The major product was cleaved at Phe 20 iAla 21 and the minor product was 

1 0 cleaved at Phe 19 iPhe 20 (referring to the numbering convention in the APP AP) peptide. 
Analysis of the cleavage products obtained from the P-secretase peptide substrates 
revealed that both the wild-type (PHA-95812E) and the Swedish mutation (PHA- 
247574E) substrates were hydrolyzed exclusively at the p-secretase site. Also, the 
relative rates of Asp- 1 -dependent hydrolysis of the p-secretase peptide substrate 

15 containing the Swedish mutation was cleaved at least 10-times faster than the 

corresponding wild-type peptide. Conversion of the y-secretase peptide substrate was 
not detected under these reaction conditions. 

Measurement of the cleavage of the a-secretase and P-secretase substrates can 
also be carried out with substrates comprising detectable labels such as radioactive, 

20 enzymati , chemiluminescent or flourescent labels. For example, the peptide 
substrates could comprise internally quenched labels that result in increased 
detectability after cleavage of the peptide substrates due to separation of the labels 
upon cleavage. The peptide substrates can be modified to have attached a paired 
fluorprobe and quencher such as 7-amino-4-methyl courarin and dinitrophenol, 

25 respectively. 

This example illustrates the a-secretase and p-secretase activity exhibited by 
Asp-1, confirming the APP processing activity of Aspl indicating, e.g., in Examples 7 
and 13. The substrates described herein may be used in combination with recombinant 
Asp 1 to measure Asp 1 proteolytic activity at the a-secretase and p-secretase 
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processing sites. These substrates are useful in screening assays for identification of 
modulators of Aspl proteolytic activity. 

In particular, production of AP species through the processing of APP at p-and 
y-secretase sites may play a central role in Alzheimer's disease pathogenesis, and 
processing at the a-secretase site may have a protective role and may prevent Ap 
production. Thus, a therapeutic and/or prophylactic indication exists for molecules 
that can increase Aspl a-secretase activity and/or decrease Aspl P-secretase activity 
in vivo. The present invention includes screening assays for such modulators, and the 
foregoing substrate peptides are useful in such assays. 

It will be clear that the invention may be practiced otherwise than as 
particularly described in the foregoing description and examples. 

Numerous modifications and variations of the present invention are possible in 
light of the above teachings and, therefore, are within the scope of the invention. The 
entire disclosure of all publications cited herein are hereby incorporated by reference. 
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What is claimed is: 

1 . A method for assaying hu- Asp 1 a-secretase activity comprising the 
steps of: 

(a) contacting hu-Aspl protein with an amyloid precursor protein (APP) 
substrate, wherein said substrate contains an a-secretase cleavage site; and 

(b) measuring cleavage of the APP substrate at the a-cleavage site, thereby 
assaying hu-Aspl a-secretase activity. 

2. A method according to claim 1 , wherein the hu-Aspl protein 
comprises a polypeptide produced in cell transformed or transfected with a 
polynucleotide comprising a nucleotide sequence that encodes hu-Aspl or a fragment 
thereof that retains Aspl a-secretase activity. 

3. A method of claim 2, wherein the hu-Aspl protein is purified and 
isolated from said cell. 

4. A method according to claims 2 or 3, wherein the nucleotide sequence 
encodes a polypeptide that comprises the hu-Aspl amino acid sequence set forth, in 
SEQ ED NO: 2 or a fragment thereof, wherein said fragment retains a-secretase 
activity. 

5. A method according to any one of claims 2-4, wherein the 
polynucleotide sequence encodes a hu-Aspl amino acid sequence lacking the 
transmembrane amino acids 469-492 of SEQ ID NO: 2. 

6 A method according to claim 5, wherein the polynucleotide sequence 
encodes a hu-Aspl amino acid sequence further lacking the cytoplasmic domain 
amino acids 493-518 of SEQ ID NO: 2. 
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7. A method according to any one of claims 1-6, wherein the hu-Aspl 
amino acid sequence lacks amino terminal amino acids 1-62 of SEQ ID NO: 2. 

8. A method according to claims 1 or 2, wherein the contacting step 

5 comprises growing a cell transfected or transformed with a polynucleotide encoding 
hu-Aspl protein or a fragment thereof that retains hu-Aspl cc-secretase activity, 
wherein the cell is grown under conditions in which the cell expresses the hu-Aspl 
protein in the presence of the APP substrate. 

10 9. A method of claim 8, wherein said cell expresses a polynucleotide 

encoding an APP substrate containing an cc-secretase cleavage site, and wherein the 
contacting step comprises growing the cell under conditions in which the cell 
expresses the hu-Aspl protein and the APP substrate. 

15 10. A method of any one of claims 1-9, wherein the APP substrate cc- 

secretase cleavage site comprises the amino acid sequence LVFFAEDF or 
KLVFFAED. 

11. A method of any one of claims 1-10, wherein the APP substrate 

20 comprises a human APP isoform and further comprises a carboxy-terminal di-lysine. 

1 2. A method of claims 10 or 1 1 , wherein the APP substrate comprises a 
detectable label. 

25 13. A method of claim 12, wherein the detectable label is selected from the 

group consisting of radioactive labels, chemiluminescent labels, enzymatic labels, 
chemiluminescent labels and flourescent labels. 
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14. A method of anyone of claims 1-13, wherein the APP substrate 
comprises a human APP isoform and the determining step comprises measuring the 
production of amyloid alpha peptide (sAPPa). 

5 15. A method of any one of claims 1-14, wherein the method further 

comprises the steps of: 

(a) determining the level of hu-Aspl a-secretase activity in the presence 
and absence of a modulator of hu-Aspl a-secretase activity; and 

(b) comparing the hu-Aspl a-secretase activity in the presence and 
10 absence of the modulator, wherein modulators that increase hu-Aspl a-secretase 

activity are identified as candidate Alzheimer's disease therapeutics. 

16. A method of claim 15, wherein the method further comprises a step of 
treating Alzheimer's disease with said candidate Alzheimer's disease therapeutic. 

15 

17. A composition comprising a candidate Alzheimer's disease therapeutic 
identified by the method of claim 15. 

18. A hu-Aspl protease substrate peptide or fragment thereof, wherein said 
20 peptide comprises an amino acid sequence consisting of fifty or fewer amino acids, 

said amino acid sequence including the hu-Aspl cleavage site having the amino acid 
sequence GLALALEP 

1 9. The substrate of claim 1 8, wherein the substrate comprises a detectable 

25 label. 

20. The substrate of claim 19, wherein the detectable label is selected from 
the group consisting of radioactive labels, enzymatic labels, chemiluminescent labels 
and flourescent labels. 

30 
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21 . A method for assaying hu-Aspl proteolytic activity comprising the 
steps of: 

(a) contacting hu-Aspl protein with an hu-Aspl substrate according to 
claim 18, 19, or 20 under acidic conditions, and 
5 (b) determining the level of hu-Aspl proteolytic activity. 

22. A method according to claim 20, wherein the hu-Aspl protein 
comprises a polypeptide produced in cell transformed or transfected with a 
polynucleotide comprising the nucleotide sequence that encodes hu-Aspl . 

10 

23. A method of claim 21 , wherein the hu-Aspl protein is purified and 
isolated from said cell. 

24. A method according to claims 22 and 23, wherein the nucleotide 

1 5 sequence encodes a polypeptide that comprises the hu-Aspl amino acid sequence set 
forth in SEQ ID NO: 2 or a fragment thereof, wherein said fragment retains 
proteolytic activity. 

25. A purified polynucleotide comprising a nucleotide sequence encoding 
20 a polypeptide that comprises a fragment of a hu-Asp-1 protein, wherein said 

nucleotide sequence lacks the sequence that encodes the transmembrane domain of 
said hu-Aspl protein, and wherein the hu-Aspl polypeptide fragment encoded by said 
polynucleotide has hu-Aspl cc-secretase activity. 

25 26. A polynucleotide of claim 25, wherein the polypeptide comprises a 

fragment of the hu-Aspl amino acid sequence set forth in SEQ ID NO: 2, and wherein 
the polypeptide lacks the transmembrane domain amino acids 469-492 of SEQ ID 
NO: 2. 
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27. A polynucleotide of claim 26 wherein the polypeptide further lacks 
cytoplasmic domain amino acids 493-518 of SEQ ID NO: 2. 

28. A purified polynucleotide comprising a nucleotide sequecne encoding 
5 a polypeptide that comprises a fragment of a hu-Aspl protein, wherein said nucleotide 

sequence lacks the sequence that encodes amino acids 1-62 of SEQ ID NO: 2, and 
wherein the polypeptide has hu-Aspl a-secretase activity. 

29. A polypeptide of claim 28, wherein said polypeptide lacks amino 
10 terminal amino acids 23-62 of SEQ ID NO: 2. 

30. A purified polynucleotide comprising a nucleotide sequence that 
hybridizes under stringent conditions to the non-coding strand complementary to SEQ 
ID NO: 1, wherein the nucleotide sequence encodes a polypeptide having Aspl 

1 5 proteolytic activity and wherein the polynucleotide lacks nucleotides encoding a 
transmembrane domain. 

31. A purified polynucleotide comprising a nucleotide sequence that 
hybridizes under stringent conditions to the nucleic acid according to claim 30, 

20 wherein the nucleotide sequence encodes a polypeptide further lacking a pro-peptide 
domain corresponding to amino acids 23-62 of SEQ ID NO: 2. 

32. A vector comprising the polynucleotide of any one of claims 25-3 1 . 

25 33. A host cell transformed or transfected with a vector of claim 32. 

34 A host cell transformed or transfected with a polynucleotide of any one 
of claims 25-31. 
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35. A purified polypeptide comprising a fragment of a hu-Aspl protein, 
wherein said polypeptide lacks the hu-Aspl transmembrane domain of said hu-Aspl 
protein and retains hu-Aspl a-secretase activity. 

5 36. A polypeptide of claim 35, wherein said polypeptide comprises a 

fragment of hu-Aspl having the amino acid sequence set forth in SEQ ID NO: 2 and 
wherein said polypeptide lacks the transmembrane domain amino acids 469-492 of 
SEQ ID NO: 2. 

10 37. A polypeptide according to claim 35 or 36, wherein said polypeptide 

further lacks cytoplasmic domain amino acids 493-518 of SEQ ID NO: 2. 

38. A polypeptide according to any one of claims 35-37, which further 
lacks amino terminal amino acids 1-62 of SEQ ID NO: 2. 

15 

39. A polypeptide comprising a fragment of hu-Aspl having the amino 
acid sequence set forth in SEQ ID NO: 2 and wherein said polypeptide lacks the 
amino terminal amino acids 1-62 of SEQ ID NO: 2 and retains APP processing 
activity. 

20 

40. A polypeptide comprising an amino acid sequence at least 95% 
identical to a fragment of hu-Aspl protein, wherein said polypeptide and said 
fragment lack a transmemebrane domain and retain hu-Aspl a-secretase activity. 

25 41 . A polypeptide comprising an amino acid sequence at least 95% 

identical to a fragment of hu-Aspl protein, wherein said polypeptide and said 
fragment lack the amino terminal amino acids corresponding to the pro-peptide 
domain of hu-Aspl and retain APP processing activity. 

30 



-102- 



WO 01/23533 



PCT/USOO/26080 



FIGURE 1 (1) 

ATGGGCGCACTGGCCCGGGCGCTGCIXX 
-MGALARALLLPLLAQWLLRA 

CCCCGGAGCTIXXK:CCC03CGCCCTTCACGCTGCCCCTCCGGGT^ 

APELAPAPFTLPLRVA A A T N 
CGCGTAGTTGCGCCC^CCCCGGGACCOXSGACCCra 

RVVAPTPGPGTPAKRHADGL 
GCGCTCGCCCrGGAGCCTGCCCI^CGTCCCCC^ 

ALALE P A LA S P AGAAN F LAM 

GTAGACAACCTGCAGGGGGACTCTGG CCGCGGCTACTACCTGGAGATG CTGATCGGGACC 
VDNLQGD SGRGYYLEMLI GT 

CCCCCGCAGAAGCTACRGATTCTCGTTGACACTGGAAGCAGTA 

P PQKLQI LVDTGSSNFAVAG 

ACC CCGCACTC CTACATAGACACGTACl'l'l'GACACAGAGAGGTCTAGCACATAC CGCTCC 
TPHSY IDTYFDTERSSTYRS 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTT 
KGFDVTVKYTQGSWTGF VGE 

GACCTCGTCAC CATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCA 
DLVTI PKGFNTSFLVNIATI 

TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGG 

FE S ENFF LP G I KWNG I LGLA 
TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCT^ 

YATL.AKP S S SLETFFDSLVT 
CAAGCAAACATCCCCAACGTTTTCTCC^TGC^ 

QANI PNVFSMQMCGAGLPVA 

GGATCTGGGACCAACGGAGGTAGTCTTGTCTTXKK5TGGAATTGAA 
GSGTNGGSLVLGGIEPSLYK 

GGAGACATCTGGTATACC C CTAT^AAGGAAGAGTGGTACTACCAGATAGAAATTCTGAAA 
GDIWYTPIKEEWYYQIEILK 

TTGGAAATTGGAGGCCAAAG C CTTAATCTGGACTG CAGAGAGTATAACGCAGAGAAGGC C 
LE IGGQS LNLDCREYNADKA 

. ATCGTGGACAGTCGCACCACGCTGCTGCGCCTG CCCCA 
IVDSGTTLLRLPQKVFDAVV 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGA^ 

EAVARASLI PEFSD GFWTGS 

CAG CTGGCGTGCTGGACGAATTCGGAAACAC CTTGGTCTTACTTCCCTAAAATCTCCATC 
QLACWTNSETPWSYFPKISI 

TACCTGAGAGATGAGAACTCCAGCAGGTC^TTCCG 

YLRDENSSRSFRITILPQLY 

ATTCAGCC CATGATGGGGG C CGG CCTGAATTATGAATGTTACCGATTCGGCATTTCC CCA 
IQPMMGAGLNYECYRFGISP 
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TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCT 
STNALVI GATVMKGFYVI FD 

AGAGCCCAGAAGAGGGTGXKSCTTCGCAGCGAGCCCCTCTGCA^ 
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FIGURE 1 (2) 



RAQKRVG FAAS P C A E I A G A A 
GTGTCTGAAATTTCCGGGCCrrrCTCAACAGAG 

VSEISGPFSTEDVASNCVPA 

CAGTCl'TrGAG CGAGC CCA'1'XTTGTGGAITGTGTC CTATGCGCTCATGAG CGTCTGTGGA 
QS LSEPILWIVSYALMSVCG 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGT^ 

AI LLVLIV LLLLPFRCQRRP 

CGTGACC CTGAGGTC GTCAATGATGAGT CCTCTCTGGTCAGACATCG CTGGAAATGAATA 
RD PEVVNDE S SLVRHRWK 

GCCAGGCCTGACCTGAAGCAACCATGAACTCAGCTA^ 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCT^ 

G CTC CCAGATG CCTTCTAGATTCACTGTCTTTTGATTCT^ 

CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACT^ 

AAAA 
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FIGURE 2 (l) 



ATGGCCCAAGCCCTGCCCTGGCTCCTC 

MAQALPWLLLWMGAGVLPAH 
GGCACCGAGGACGGCATCCGGCTGCCCCTC 

GTQHGIRLPLRSGLGGAPLG 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 
LR LPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 
VEMVDNLRGKSGQGYYVEMT 

GTGGGCAGCCCCCCGCAGACGCTCT^CATCCTGGTGGATAC^ 

VGSPPQTLNILVDTGSSNFA 
GTGGGTGCTGCCCCCCACCCCTTCCTGCATC^ 

VGAAPHPFLHRYYQRQLSST 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 
YRDLRKGVYVPYTQGKWEGE 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 
LGTDLVS I PHG PNVTVRANI 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGck?AAGG 
AAITESDKFFINGSNWEGIL 

GGGCTGGCCTATGCTGAGATTGCCAGGCTTTGTGGTGCTGGCTTCC 
GLAYAEIARLCGAGFPLNQ S 

GAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATCGACQACT 
EVLASVGGSMI IGGIDHSL. Y 

ACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTATGAGGTGATCATTGTG 
TGS L WYTPIRREWYYEVI IV 

CGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAGTAGAACTATGACAAG 
RVE INGQDLKMDCKEYNYDK 

AGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAAGTGTTTGAA 
SIVDSGTTNLRLPKKVFEAA 

GTCAAATCCATCAAGG CAGC CTC CTC CACGG AGAAGTTCCCTGATGGTTTCTGGCTAGGA 
VKS I KAASSTEKFPDGFWLG 

GAGGAGCTGGTGTGCTGGCAAGCAGGCACGACCCCTTGGAA 

EQLVCWQAGTTPWNIFPVIS 
CTCTACCTAATGGGTGAGGTTACGAACCAGTCCTT 

LYLMGEVTNQSFRITILPQQ 
TACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGTTACAAGTTTO 
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FIGURE 2 (2) 



YLRPVEDVATSQDDCYKFAI 
TCACAGTGATCCACGGGCACTGTTATGGGAGC 

SQSSTGTVMGAVIMEGFYVV 

TTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGCCATC 
FDRARKRIGFAVSACHVHDE 

TTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATGGAAG^ 
FRTAAVEGPFVTLDMEDCGY 

AACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCCTATGTCATGGCTC 
NIPQTDESTLMTIAYVMAAI 

TGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGGCGCTGCC 
CALFMLPLCLMVCQWRCLRC 

CTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTGAAGTC 
LRQQHDDFADDISLLK 

TGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTGACTTTGGTC^ 
GGAGACACAGATGGOACCTGTGGCCAGAGCACCT 

CTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAGGGACTGTACCTGTAG 

GAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAA 

TTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCT 

TTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTA 

GCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGA 

CCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTAT^ 

GACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTTGAA7VAAAAAA 
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FIGURE 3 (1) 



ATGGCCCAAGCCCTGCCCTGGC'TCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 
MAQAL PWLLLWMGAGVLPAH 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 
GT QHG IRLPLRSGL GGAPLG 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 
LRLPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATC 
VEMVDNLRGKSGQGYYVEMT 

GTGGGCAGCCCCCCGGAGACGCTCAAGATCCTC 

VGSPPQTLNILVDTGSSNFA 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 
VGAAPHPFLHRYY QRQLSST 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 
YRDLRKGVYVPYTQGKWEGE 

CTGGGGACCGACCTGGTAAGCATCCCCGATC 

LGTDLVS IPHGPNVTVRANI 
GCTGCCATCACTGAATGAGACAAGTTCTTGA^ 

AAITESDKFFINGSNWEGIL 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCITTCTTTGACT 
GLAYAEIARPDDS LE PFFDS 

CTGGTAAAGGAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 
LVKQ'THVPNLFSLQLCGAGF 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 
PLNQSEVLASVGGSMIIGGI 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 
DHSLYTGSLWYTPIRREWYY 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 
EVI IVRVEINGQDLKMDCKE 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCAC 

YNYDKS IVDSGTTNLRLPKK 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTC 
VFEAAVKSIKAASSTEKFPD 
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FIGURE 3 (2) 



GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCA 

GFWLGEQLVCWQAGTTPWNI 
TTCCCAGTGATCTCACTCTACCTAATGGGTGAGG 

FPVISLYL MGBVTNQSFRIT 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTC 
ILPQQYLRPVEDVATSQDDC 

TACAAGTTTGCCATCTCACA.GTCATCCACGGGCA 

Y K FA I S Q S .S TGTVMGAV I ME 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTC 
GFYVVFDRARKRIGFAVSAC 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTC 
HVHDEFRTAAVEGPFVTLDM 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCT 

EDCGYNIPQTDESTLMTIAY. 
GTCATGGCTGCCATCTGCGCCCTCTTGA^ 

VMAAI CALFMLPLCLMVCQW 
CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGAC 

RCLRCLRQQHDDFADDISLL 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 
K 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGA 

C C AC CCAC C AAATGC CTCTGC CTTGATGGAGAAGG AAAAGGCTGGCAAGGTGGGTTCCAG 
GGACTGTACCTGTAGGAAACAGAAAAGAGAAGTUU^GAAGCACTCTGCTGGCGGGAATACT 
CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAA ACTT CAGCC CTGA ACCT 
TTGTCCACCATTCCTTTAAATTCTCCAACCGAAAGTATTCTTCT^ 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 
AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 
TG C TTTAG AG ACAGGG ACTG TAT AAAC AAG C CTAACATTGGTG C AAAG ATTG C C TCTTGA 
ATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 4 



ATGGCCCCAGCGCTGCACTGGCTCCTGCT 

MAPALHWLLLWVGSGMLPAQ 
GGAACCCATCTCGGC^TCCGGCTGCCCCTTOSC^ 

GTHLGI RLPLRSGLAGPP LG 
CTGAGGCTGCCCCGGGAGACTGACGAGGAATCGGAGGAGCCTGG 

LRLPRETDEESEEPGRRGSF 
GTGGAGATGGTGGACAACCTGAGGGGAAAGTCCGGCCAGGGCTACTATGTGGAGATGACC 
VE MVDNLRGKSGQGYYVEMT 
GTAGGCAGCCCCCGACAGACGCTCAACATCCTGGTGGACACGGGCAGT^ 

vgsppqtlnilvdtgssnfa 

gtgggggctgcccca(^cccittcctgcatcgcrractac 

vgaaphpflhryyqrqls st 

tatcgagacctccgaaagggtgtgtatgtgccctacacccagggcaagtgggag 

yrdlrkgvyvpytqgkwege 

ctgggcaccgacctggtgagcatccctcatggccccaacgtcactgtc 

lgtdlvs i phgpnvtvrani 

gctgccatcactgaatcggacaagttcttcatcaatgg 

aaitesdkffingsnwegil 

gggctggccratgcrgagattgccaggcccgacgaotctto 

glayae iarpddslepffds 

ctggtgaagcagacccacattcccaacatcttttccctgcagcr 

lvkqthi pni fslqlcgagf 

cccctcaaccagaccgaggcactggcctcggtgggag 

plnqtealasvggsmi iggi 

gaccactcgctatacacgggcagtctctggtagacacccatccg 

dhslytgslwytpirrewyy 

gaagtgatcattgtacgtgtggaaatcaatggtcaagat^ 

evi ivrve ingqdlkmdc ke 

tacaactacgacaagagcattgtggacagtgggaccaccaaccttcgcttg 

ynydks ivdsgttnlrlp kk 

gtatttgaagctgccgtcaagtccatcaaggcagcctcctc^ 

v f e a a v k s ikaasstekfpd 

ggcttttggctaggggagcagctggtgtgctggcaagc^ 

g f w l g eqlvcwqagttpwni 

ttcccagtcatitcactttacctcatgggtgaagtra^ 

fpvislylmgevtnqs frit 

atccttcctcagcaatacctacggccggtggaggacgtggccacgtcccaagacg^ 

ilpqqylrpvedvatsqddc 

tacaagttcgctgtctc^cagtcatccacgggcactgttatcggagccgt 

ykfavsqs stgtvmgavi me 

ggtttctatgtcgtcttcgatcgagcccgaaagcgaat^ 

gfyvvfdrarkrigfavsac 

catgtgcacgatgagttcaggacgg cgg cagtggaaggt c cgtttgttacggcagacatg 

hvhde frtaavegpfvtadm 

gaagactgtggctaczaacattc c c cagacagatglmtcaacacttatgaccatagcctat 

edcgyni pqtdestlmtiay 

gtcatggcggccatctgcgccctcitrcatgt 

vma ai calfmlplclmvcqw 
cgctgcctocgttgcctgcgccaccagcacgatgactttgctk^ 
rclrclrhqhddfadd i s. l l 
aagtaaggaggctcgtgggcagatgatggagacgcccctggaccacat 

K 

CITTGGTCACATGAGTTGGAGCTATGGATGGTACCTGTGGCCA 
CACCAACCTGCC^TGCTTCTIt5GCGTX3ACAGAAC7^^ 

GGGCTTGCAC CTGTAGGACACAGGAGAGGGAAGGAAGCAGCGTTCTGGTGGCAGGAATAT 
CCTTAGGCACCAC^UUUrrTCAGTTGGAAATT^ 

CTGCCCAG CATC CXTTAGAGTCTC CAAC CTAAAGTATTCTTTATGTCCTTCCAGAAGTAC 
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TGGCGTCATACTCAGGCTACCCGGCATGTG 

AATCTCATTCCCTGCTGGCCAAAGTCAGCAGAAGAAGGTGAAGT^ 

TGATAGGGACTCCAGACTCAAGC 

GAA 
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FIGURE 5 

1 MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEE 50 

II II Mill. I- MM II 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 I !! 1 1 

1 MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLAGPPLGLRLPRETDEE 50 
51 PEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFA 100 

MIMMMMMMMIIIIMIMIMMMIMMIIMMIIMI 

51 SEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFA 100 

• • * • • 

101 VGAAPHP FLHR YYQRQLS STYRDLRKGVYVP YTQGKWEGELGTDLVS I PH 150 

I I I 1 1 I 1 1 I 1 1 1 1 1 I 1 1 1 II M 1 1 1 MM 1 1 M i 1 1 1 M M I I 1 1 1 1 II I 

101 VGAAPHP FLHRYYQRQLS S TYRDLRKGVYVP YTQGKWEGE LGTDL VS I PH 150 

• « • • ■ 

151 GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 200 

1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

151 GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 200 

201 LVKQTHVPNLFS LQLCGAGFPLNQS EVLAS VGGSMI IGGIDHS LYTGSLW 2 50 

I I I I I I : I I : M I I I 1 I I I I I I I M MMMMMMMMMIMM 
2 01 LVKQTHIPNIFSLQLCGAGFPLNQTEALASVGGSMIIGGIDHSLYTGSLW 250 

251 YTPIRREWYYEVI IVRVE INGQDLKMDCKE YNYDKS IVDSGTTNLRLPKK 3 00 

M M II 1 1 M 1 1 M M I I M 1 1 1 M II M 1 1 M M M M I I I II I M II I 

251 YTPIRREWYYEVI IVRVEINGQDLKMDCKEYNYDKS IVDSGTTNLRLPKK 300 
301 VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMG 350 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

301 VFEAAVKS I KAAS STEKFPDGFWLGEQLVCWQAGTTPWNI FPVIS LYLMG 350 

351 EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 4 00 

I M M I I I I I I I II I II I I! II I I I I II I I II I h II I I I I I II I I M I I 
351 EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIME 400 

401 GFYVVFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 450 

1 1 1 1 I 1 1 1 I I ! 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 I f MIMMIMM 

401 GFYVVFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQT 450 

451 DESTL^IAYVMAAICALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLL 500 

MMMMMMMMMMMMMMIIMIMM M I I I I I I I I 1 I 
451 DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLL 500 

501 K 501 
I 

501 K 501 
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FIGURE 6 (1) 



ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCACCCAGCACGGCATCCGG 
MASMTGGQQMGRGS T Q H G I R 

CTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACC 
LPLRSGLGGAPLGLRLPRET 

GACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTG 
DEE PEEPGRRGS FVEMVDNL 

AGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACG 
RGKSGQGYYVEMTVGSPPQT 

CTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCC 
LNI LVDTGSSNFAVGAAPH P 

TTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGC 
FLHRYYQRQLSSTYRDLRKG 

GTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGC 
VYVPYTQGKWEGELGT DLVS 

ATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATTGCTGCCATCACTGAATCAGAC 
I PHGPNVTVRAN IAAI TES D 

AAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATT 
KFFINGSNWEGI LGLAYAEI 

GCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTT 
ARPDDS LEPFFDSLVKQTHV 

CCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTG 
P. NLFS LQLCGA GFPLNQS EV 

CTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATCGACCACTCGCTGTACACAGGC 
L ASVGGSMI IGGI DHSLYTG 

AGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTATGAGGTCATCATTGTGCGGGTG 
SLWYTP I RREWYYEVI IVRV 

GAGATCAATGGACAGGAT CT GAAAAT GGACT GCAAGGAGTACAACTAT GACAAGAGCATT 
EINGQDLKMDCKEYNYDKS I 

GTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAA 
VDSGTTNLRLPKKVFEAAVK 

TCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAG 
SI KAAS STEKFPDGFWLGEQ 

CTGGTGTGCTGGC7VAGCAGGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTAC 
LVCWQAGTTPWNI FPVISLY 

CTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTG 
LMGEVTNQS F R I TI LP'QQYL 

CGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGTTACAAGTTTGCCATCTCACAG 
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FIGURE 6 (2) 



RPVEDVAT SQDDCYKFAI SQ 

TCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGAT 
S S TGTVMGAVI MEGFYVV FD 

CGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGG 
R A R K R I GFAVSACHVHDE FR 

ACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATGGAAGACTGTGGCTACAACATT 
TAAVEGPFVTLDMEDCG YN I 

C CACAGACAGAT GAGT CAT GA 
P Q T D E S * 
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FIGURE 7 (1) 

ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCGATGACTATCTCTGACTCT 
MASMTGGQQMGRGSMTISDS 

CCGCGTGAACAGGACGGATCCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTG 
P R E Q D G S TQHGIRLPLRSGL 

GGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCC 
GGAPLGLRL. PRETDEEPEEP 

GGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGC 
GRRGS FVEMVDNLRGKSGQG 

TACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACA 
YYVEMTVGSPPQTLN I LVDT 

GGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAG 
GSSNFAVGAAPHPFLHRYYQ 

AGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGCGTGTATGTGCCCTACACCCAG 
RQLSSTYRDLRKGVYVPYTQ 

GGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTC 
GKWEGELGTDLVS I PHG PNV 

ACTGTGCGTGCCAACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCC 
TVRANIAAITESDKFFINGS 

AACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTG 
NWEGI LGLAYAEIARPDDSL 

GJ^GCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAG 
E PFFDS LVKQTHVPNLFSLQ 

CTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGC 
LCGAGFPLNQSEVLASVGGS 

AT GAT CATT G GAG GTAT C GAC CACT C GCT GTACACAGGCAGT CT CTGGT ATACAC C CAT C 
MI IGGIDHSLYTGSLWYTPI 

CGGCGGGAGTGGTATTAT GAG GT CAT CATT GTGCGGGTGGAGATCAATGGACAGGATCTG 
RREWYYEVI IVRVEINGQDL 

AAAATGGACTGCAAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAAC 
K MDCKEYNYDKS IVDS GTTN 

CTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCC 
LRLPKKVFEAAVKSI KAASS 

ACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGC 
TEKFP DGFWLGEQLVCWQAG 

ACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAAC 
TTPWNI FPVI SLYLMGEVTN 
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FIGURE 7 (2) 



CAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCC 
QSFRITILPQQYLrRPVEDVA 

ACGTCCCAAGACGACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATG 
TSQDDCYKFAI SQS STGTVM 

GGAGCTGTtATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGC 
GAVI M'EGFYVVFD RARKRI G 

TTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCT 
FAVSACHVHDEFRTAAVEGP 

TTTGTCACCTTGGACATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCATGA 
FVT LDMEDCGYN I PQTDES + 



14/20 



WO 01/23533 



PCT/US00/26080 



FIGURE 8 (1) 



ATGACTCAGCATGGTATTCGTCTGCCACTGCGTAGCGGTCTGGGTGGTGCTCCACTGGGT 
MTQHGI RLPLRSGLGGAP LG 

CTGCGTCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 
LRLPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC. 
V EMVDNLRGKS GQGYYVEMT 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 
VGS PPQTLNI LVDTGS S N F A 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 
VGAAPH PFLHRYYQRQL S ST 

TACCGGGACCTCCGGAAGGGCGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 
YRDLRKGVYVPYTQGKWEGE 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 
LGTDLVS I PHGPNVTVRAN I 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 
AAITESDKFFINGSNWEGIL 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 
GLAYAEIARPDDSLEPFFDS 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 
LVKQTHVPN LFSLQLCGAGF 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 
PLN QS EVLASVGGSMI I GGI 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 
DHSLYTGSLWYTPI RREWYY 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 
EVI IV RVE I NGQDLKMDCKE 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 
YNYDKS IVDSGTTNLRLPKK 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 
VFEAAVKS I KAASSTEKFPD 

GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 
GFWLGEQLVCWQAGTT PWN I 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTTCGCATCACC 
FPVI SLYLMGEVTNQS FRIT 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGT 
I LPQQYLRPVEDVATS QDDC 
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FIGURE 8 (2) 



TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 
YKFAI SQSSTGTVMGAVIME 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 
GFYVVFDRARKRIGFAVSAC 

CATTAG 
H * 
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FIGURE 9 
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PIGURE 10 
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FIGURE 11 



MAOALPWLLLWMGAGVLPAHG TOHGIRLPLRSGIiGGAPLGLRLPRETDEE 
PEE PGRRGS FVEMVDNLRGKSGQGYYVEMTVGS PPQTLNI LVDTGS SNFA 
VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 
GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 
LVKQTHVPNLFSLQLCGAGFPLNQSEVIiASVGGSMIIGGIDHSLYTGSLW 
YTP I RREWYYEVI IVRVE INGQDLKMDCKE YNYDKS IVDSGTTNLRLPKK 
VFEAAVKS IKAASSTEKFPDGFWLGEQLVCWQAGTTPWNI FPVISL YLMG 
EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 
GFYVVFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 
DES 
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FIGURE 12 



MAQALPWLLLWMGAGVLPAHG TQHGIRLPLRSGLGGAPLGLRLPRETDEE 
PEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFA 
VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 
GPNVTVRJ^IAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 
LVKQTHVPNLFS LQLCGAGFPLNQS E VLAS VGGSMI IGG IDHS L YTGS LW 
YTP IRREWYYEVI IVRVE INGQDLKMDCKE YNYDKS IVDSGTTNLRLPKK 
VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMG 
EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 
GFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 
DESHHHHHH 
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SEQUENCE LISTING 

<110> Pharmacia & Upjohn 

<120> ALZHEIMER'S DISEASE SECRETASE , APP SUBSTRATES THEREFOR, AND USES 
THEREFOR 

<130> 28341/6280.1 

<140> 
<141> 

<150> 60/169,232 
<151> 1999-12-06 

<150> 09/416,901 
<151> 1999-10-13 

<150> 60/155,493 
<151> 1999-09-23 

<150> 09/404,133 
<151> 1999-09-23 

<150> PCT/US99/20881 
<151> 1999-09-23 

<150> 60/101,594 
<151> 1998-09-24 

<160> 76 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 1804 

<212> DNA 

<213> Homo sapiens 

<400> 1 

atgggcgcac tggcccgggc gctgctgctg cctctgctgg cccagtggct cctgcgcgcc 60 
gccccggagc tggcccccgc gcccttcacg ctgcccctcc gggtggccgc ggccacgaac 12 0 
cgcgtagttg cgcccacccc gggacccggg acccctgccg agcgccacgc cgacggcttg 180 
gcgctcgccc tggagcctgc cctggcgtcc cccgcgggcg ccgccaactt cttggccatg 240 
gtagacaacc tgcaggggga ctctggccgc ggctactacc tggagatgct gatcgggacc 30 0 
cccccgcaga agctacagat tctcgttgac actggaagca gtaactttgc cgtggcagga 360 
accccgcact cctacataga cacgtacttt gacacagaga ggtctagcac ataccgctcc 420 
aagggc'tttg acgtcacagt gaagtacaca caaggaagct ggacgggctt cgttggggaa 480 
gacctcgtca ccatccccaa aggcttcaat acttcttttc ttgtcaacat tgccactatt 540 
tttgaatcag agaatttctt tttgcctggg attaaatgga atggaatact tggcctagct 600 
tatgccacac ttgccaagcc atcaagttct ctggagacct tcttcgactc cctggtgaca 66 0 
caagcaaaca tccccaacgt tttctccatg cagatgtgtg gagccggctt gcccgttgct 720 
ggatctggga ccaacggagg tagtcttgtc ttgggtggaa ttgaaccaag tttgtataaa 780 
ggagacatct ggtatacccc tattaaggaa gagtggtact accagataga aattctgaaa 840 
ttggaaattg gaggccaaag ccttaatctg gactgcagag agtataacgc agacaaggcc 900 
atcgtggaca gtggcaccac gctgctgcgc ctgccccaga aggtgtttga tgcggtggtg 960 
gaagctgtgg cccgcgcatc tctgattcca gaattctctg atggtttctg gactgggtcc 1020 
cagctggcgt gctggacgaa ttcggaaaca ccttggtctt acttccctaa aatctccatc 1080 
tacctgagag atgagaactc cagcaggtca ttccgtatca caatcctgcc tcagctttac 1140 
attcagccca tgatgggggc cggcctgaat tatgaatgtt accgattcgg catttcccca 1200 
tccacaaatg cgctggtgat cggtgccacg gtgatggagg gcttctacgt catcttcgac 1260 
agagcccaga agagggtggg cttcgcagcg agcccctgtg cagaaattgc aggtgctgca 1320 
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2- 



gtgtctgaaa 
cagtctttga 
gccatcctcc 
cgtgaccctg 
gccaggcctg 
agcagccggg 
gctcccagat 
ctccctactt 
aaaa 



tttccgggcc 
gcgagcccat 
ttgtcttaat 
aggtcgtcaa 
acctcaagca 
atcgatggtg 
gccttctaga 
ccaagaaaaa 



tttctcaaca 
tttgtggatt 
cgtcctgctg 
tgatgagtcc 
accatgaact 
gcgctttctc 
ttcactgtct 
taattaaaaa 



gaggatgtag 
gtgtcctatg 
ctgctgccgt 
tctctggtca 
cagctattaa 
ctgtgcccac 
tttgattctt 
aaaaacttca 



ccagcaactg 
cgctcatgag 
tccggtgtca 
gacatcgctg 
gaaaatcaca 
ccgtcttcaa 
gattttcaag 
ttctaaacca 



tgtccccgct 
cgtctgtgga 
gcgtcgcccc 
gaaatgaata 
tttccagggc 
tctctgttct 
ctttcaaatc 
aaaaaaaaaa 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1804 



<210> 2 
<211> 518 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Gly Ala Leu Ala 
1 5 

Leu Leu Arg Ala Ala 
20 

Leu Arg Val Ala Ala 
35 

Pro Gly Thr Pro Ala 
50 

Glu Pro Ala Leu Ala 
65 

Val Asp Asn Leu Gin 
85 

Leu He Gly Thr Pro 
100 

Ser Ser Asn Phe Ala 
115 

Tyr Phe Asp Thr Glu 
13 0 

Val Thr Val Lys Tyr 
145 

Asp Leu Val Thr He 
165 



Arg Ala Leu 

Pro Glu Leu 

Ala Thr Asn 
40 

Glu Arg His 
55 

Ser Pro Ala 
70 

Gly Asp Ser 
Pro Gin Lys 



Val Ala Gly 
120 

Arg Ser Ser 
135 

Thr Gin Gly 
150 

Pro Lys Gly 



Leu Leu Pro 
10 

Ala Pro Ala 
25 

Arg val Val 
Ala Asp Gly 



Gly Ala Ala 
75 

Gly Arg Gly 
90 

Leu Gin He 
105 

Thr Pro His 



Thr Tyr Arg 



Ser Trp Thr 
155 

Phe Asn Thr 
170 



Leu Leu 



Pro Phe 



Ala Pro 
45 

Leu Ala 
60 

Asn Phe 



Tyr Tyr 

Leu Val 

Ser Tyr 
125 

Ser Lys 
140 

Gly Phe 
Ser Phe 



Ala Gin Trp 
15 

Thr Leu Pro 
30 

Thr Pro Gly 



Leu Ala Leu 



Leu Ala Met 
80 

Leu Glu Met 
95 

Asp Thr Gly 
110 

lie Asp Thr 



Gly Phe Asp 



Val Gly Glu 
160 

Leu Val Asn 
175 



He Ala Thr He Phe Glu Ser Glu 
180 

Trp Asn Gly He Leu Gly Leu Ala 

195 200 

Ser Ser Leu Glu Thr Phe Phe Asp 

210 215 



Asn Phe Phe Leu Pro Gly He Lys 
185 190 

Tyr Ala Thr Leu Ala Lys Pro Ser 
205 

Ser Leu Val Thr Gin Ala Asn He 
220 



Pro Asn Val Phe Ser Met Gin Met Cys Gly Ala Gly Leu Pro Val Ala 
225 230 235 240 
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Gly Ser Gly Thr Asn Gly Gly Ser Leu Val Leu Gly Gly lie Glu Pro 
' J 245 ' 250 ' 255 

Ser Leu Tyr Lys Gly Asp lie Trp Tyr Thr Pro lie Lys Glu Glu Trp 
260 265 270 

Tyr Tyr Gin lie Glu lie Leu Lys Leu Glu lie Gly Gly Gin Ser Leu 
275 280 285 

Asn Leu Asp Cys Arg Glu Tyr Asn Ala Asp Lys Ala lie Val Asp Ser 
290 295 300 

Gly Thr Thr Leu Leu Arg Leu Pro Gin Lys Val Phe Asp Ala Val Val 
305 310 315 320 

Glu Ala Val Ala Arg Ala Ser Leu lie Pro Glu Phe Ser Asp Gly Phe 
325 330 335 

Trp Thr Gly Ser Gin Leu Ala Cys Trp Thr Asn Ser Glu Thr Pro Trp 
340 345 350 

Ser Tyr Phe Pro Lys lie Ser lie Tyr Leu Arg Asp Glu Asn Ser Ser 
355 360 365 

Arg Ser Phe Arg lie Thr lie Leu Pro Gin Leu Tyr lie Gin Pro Met 
370 375 380 

Met Gly Ala Gly Leu Asn Tyr Glu Cys Tyr Arg Phe Gly lie Ser Pro 
385 390 395 400 

Ser Thr Asn Ala Leu Val lie Gly Ala Thr Val Met Glu Gly Phe Tyr 
405 410 415 

Val lie Phe Asp Arg Ala Gin Lys Arg Val Gly Phe Ala Ala Ser Pro 
420 425 430 

Cys Ala Glu He Ala Gly Ala Ala Val Ser Glu He Ser Gly Pro Phe 
435 440 445 

Ser Thr Glu Asp Val Ala Ser Asn Cys Val Pro Ala Gin Ser Leu Ser 
450 455 460 



Glu Pro He Leu Trp He Val Ser Tyr Ala Leu Met Ser Val Cys Gly 
465 470 475 480 

Ala He Leu Leu Val Leu He Val Leu Leu Leu Leu Pro Phe Arg Cys 
485 490 495 

Gin Arg Arg Pro Arg Asp Pro Glu Val Val Asn Asp Glu Ser Ser Leu 
500 505 510 

Val Arg His Arg Trp Lys 
515 



<210> 3 

<211> 2070 

<212> DNA 

<213> Homo sapiens 

<400> 3 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 
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ggcacccagc 
ctgcggctgc 
gtggagatgg 
gtgggcagcc 
gtgggtgctg 
taccgggacc 
ctgggcaccg 
gctgccatca 
gggctggcct 
ctggtaaagc 
cccctcaacc 
gaccactcgc 
gaggtcatca 
tacaactatg 
gtgtttgaag 
ggtttctggc 
ttcccagtca 
atccttccgc 
tacaagtttg 
ggcttctacg 
catgtgcacg 
gaagactgtg 
gtcatggctg 
cgctgcctcc 
aagtgaggag 
ctttggtcac 
ccacccacca 
ggactgtacc 
cttggtcacc 
ttgtccacca 
gtactggcat 
agaccaagct 
tgctttagag 
attaaaaaaa 



acggcatccg 
cccgggagac 
tggacaacct 
ccccgcagac 
ccccccaccc 
tccggaaggg 
acctggtaag 
ctgaatcaga 
atgctgagat 
agacccacgt 
agtctgaagt 
tgtacacagg 
ttgtgcgggt 
acaagagcat 
ctgcagtcaa 
taggagagca 
tctcactcta 
agcaatacct 
ccatctcaca 
ttgtctttga 
atgagttcag 
gctacaacat 
ccatctgcgc 
gctgcctgcg 
gcccatgggc 
aagtaggaga 
aatgcctctg 
tgtaggaaac 
tcaaatttaa 
ttcctttaaa 
cacacgcagg 
tgtttccctg 
acagggactg 
aaaaaaaaaa 



gctgcccctg 
cgacgaagag 
gaggggcaag 
gctcaacatc 
cttcctgcat 
tgtgtatgtg 
catcccccat 
caagttcttc 
tgccaggcct 
tcccaacctc 
gctggcctct 
cagtctctgg 
ggagatcaat 
tgtggacagt 
atccatcaag 
gctggtgtgc 
cctaatgggt 
gcggccagtg 
gtcatccacg 
tcgggcccga 
gacggcagcg 
tccacagaca 
cctcttcatg 
ccagcagcat 
agaagataga 
cacagatggc 
ccttgatgga 
agaaaagaga 
gtcgggaaat 
ttctccaacc 
ttaccttggc 
ctggccaaag 
tataaacaag 
aaaaaaaaaa 



cgcagcggcc 
cccgaggagc 
tcggggcagg 
ctggtggata 
cgctactacc 
ccctacaccc 
ggccccaacg 
atcaacggct 
gacgactccc 
ttctccctgc 
gtcggaggga 
tatacaccca 
ggacaggatc 
ggcaccacca 
gcagcctcct 
tggcaagcag 
gaggttacca 
gaagatgtgg 
ggcactgtta 
aaacgaattg 
gtggaaggcc 
gatgagtcaa 
ctgccactct 
gatgactttg 
gattcccctg 
acctgtggcc 
gaaggaaaag 
agaaagaagc 
tctgctgctt 
caaagtattc 
gtgtgtccct 
tcagtaggag 
cctaacattg 



tggggggcgc 
ccggccggag 
gctactacgt 
caggcagcag 
agaggcagct 
agggcaagtg 
tcactgtgcg 
ccaactggga 
tggagccttt 
acctttgtgg 
gcatgatcat 
tccggcggga 
tgaaaatgga 
accttcgttt 
ccacggagaa 
gcaccacccc 
accagtcctt 
ccacgtccca 
tgggagctgt 
gctttgctgt 
cttttgtcac 
ccctcatgac 
gcctcatggt 
ctgatgacat 
gaccacacct 
agagcacctc 
gctggcaagg 
actctgctgg 
gaaacttcag 
ttcttttctt 
gtggtaccct 
aggatgcaca 
gtgcaaagat 



ccccctgggg 
gggcagcttt 
ggagatgacc 
taactttgca 
gtccagcaca 
ggaaggggag 
tgccaacatt 
aggcatcctg 
ctttgactct 
tgctggcttc 
tggaggtatc 
gtggtattat 
ctgcaaggag 
gcccaagaaa 
gttccctgat 
ttggaacatt 
ccgcatcacc 
agacgactgt 
tatcatggag 
cagcgcttgc 
cttggacatg 
catagcctat 
gtgtcagtgg 
ctccctgctg 
ccgtggttca 
aggaccctcc 
tgggttccag 
cgggaatact 
ccctgaacct 
agtttcagaa 
ggcagagaag 
gtttgctatt 
tgcctcttga 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2070 



<210> 4 

<211> 501 

<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Ala Gin Ala Leu 
1 5 

Leu Pro Ala His Gly 
20 

Gly Leu Gly Gly Ala 
35 



Pro Trp Leu Leu 
Thr Gin 



Pro Leu 



His Gly 
25 

Gly Leu 
40 



Leu Trp Met 
10 

lie Arg Leu 
Arg Leu Pro 



Gly Ala Gly Val 
15 

Pro Leu Arg Ser 
30 

Arg Glu Thr Asp 
45 



Glu Glu Pro Glu Glu Pro Gly Arg Arg 

50 55 

Asp Asn Leu Arg Gly Lys Ser Gly Gin 

65 70 



Gly Ser Phe 
60 

Gly Tyr Tyr 
75 



Val Glu Met Val 



Val Glu Met Thr 
80 



Val Gly Ser Pro Pro 
85 



Gin Thr Leu Asn 



He Leu val 
90 



Asp Thr Gly Ser 
95 



Ser Asn Phe Ala Val Gly Ala Ala Pro 
100 105 



His Pro Phe 



Leu His Arg Tyr 
110 
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Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 ' 135 140 

Leu Val Ser He Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 150 155 160 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe lie Asn Gly Ser Asn Trp 
165 170 175 

Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp 
180 185 190 

Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro 
195 200 205 

Asn Leu Phe Ser Leu His Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 

Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He 
225 230 235 240 

Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg 
245 250 255 

Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin 
260 265 270 

Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 
275 280 285 

Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 
290 295 300 

Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 
305 310 315 320 

Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 

Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
340 345 350 

Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 380 

He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu 
385 390 395 400 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 

Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 

Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
435 440 445 
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Gln Thr Asp Glu Ser 
450 

lie Cys Ala Leu Phe 
465 

Arg Cys Leu Arg Cys 
485 



Thr Leu Met Thr He Ala 
455 

Met Leu Pro Leu Cys Leu 
470 475 

Leu Arg Gin Gin His Asp 
490 



Tyr Val Met Ala Ala 
460 

Met Val Cys Gin Trp 
480 

Asp Phe Ala Asp Asp 
495 



He Ser Leu Leu Lys 
500 



<210> 5 

<211> 1977 

<212> DNA 

<213 > Homo sapiens 



<400> 5 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 

ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccctgggg 120 

ctgcggctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagcttt 180 

gtggagatgg tggacaacct gaggggcaag tcggggcagg gctactacgt ggagatgacc 240 

gtgggcagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taactttgca 300 

gtgggtgctg ccccccaccc cttcctgcat cgctactacc agaggcagct gtccagcaca 360 

taccgggacc tccggaaggg tgtgtatgtg ccctacaccc agggcaagtg ggaaggggag 420 

ctgggcaccg acctggtaag catcccccat ggccccaacg tcactgtgcg tgccaacatt 480 

gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcatcctg 540 

gggctggcct atgctgagat tgccaggctt tgtggtgctg gcttccccct caaccagtct 600 

gaagtgctgg cctctgtcgg agggagcatg atcattggag gtatcgacca ctcgctgtac 660 

acaggcagtc tctggtatac acccatccgg cgggagtggt attatgaggt gatcattgtg 720 

cgggtggaga tcaatggaca ggatctgaaa atggactgca aggagtacaa ctatgacaag 7 80 

agcattgtgg acagtggcac caccaacctt cgtttgccca agaaagtgtt tgaagctgca 840 

gtcaaatcca tcaaggcagc ctcctccacg gagaagttcc ctgatggttt ctggctagga 900 

gagcagctgg tgtgctggca agcaggcacc accccttgga acattttccc agtcatctca 960 

ctctacctaa tgggtgaggt taccaaccag tccttccgca tcaccatcct tccgcagcaa 1020 

tacctgcggc cagtggaaga tgtggccacg tcccaagacg actgttacaa gtttgccatc 1080 

tcacagtcat ccacgggcac tgttatggga gctgttatca tggagggctt ctacgttgtc 1140 

tttgatcggg cccgaaaacg aattggcttt gctgtcagcg cttgccatgt gcacgatgag 1200 

ttcaggacgg cagcggtgga aggccctttt gtcaccttgg acatggaaga ctgtggctac 1260 

aacattccac agacagatga gtcaaccctc atgaccatag cctatgtcat ggctgccatc 1320 

tgcgccctct tcatgctgcc actctgcctc atggtgtgtc agtggcgctg cctccgctgc 1380 

ctgcgccagc agcatgatga ctttgctgat gacatctccc tgctgaagtg aggaggccca 1440 

tgggcagaag atagagattc ccctggacca cacctccgtg gttcactttg gtcacaagta 1500 

ggagacacag atggcacctg tggccagagc acctcaggac cctccccacc caccaaatgc 1560 

ctctgccttg atggagaagg aaaaggctgg caaggtgggt tccagggact gtacctgtag 1620 

gaaacagaaa agagaagaaa gaagcactct gctggcggga atactcttgg tcacctcaaa 1680 

tttaagtcgg gaaattctgc tgcttgaaac ttcagccctg aacctttgtc caccattcct 1740 

ttaaattctc caacccaaag tattcttctt ttcttagttt cagaagtact ggcatcacac 1800 

gcaggttacc ttggcgtgtg tccctgtggt accctggcag agaagagacc aagcttgttt 1860 

ccctgctggc caaagtcagt aggagaggat gcacagtttg ctatttgctt tagagacagg 1920 

gactgtataa acaagcctaa cattggtgca aagattgcct cttgaaaaaa aaaaaaa 1977 



<210> 6 

<211> 476 

<212> PRT 

<213 > Homo sapiens 



<400> 6 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp 
15 10 



Met Gly Ala Gly Val 
15 
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Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro Leu Arg Ser 
20 25 30 

Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn lie 
145 150 155 160 

Ala Ala lie Thr Glu Ser Asp Lys Phe Phe lie Asn Gly Ser Asn Trp 
165 170 175 

Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu lie Ala Arg Leu Cys Gly 
180 185 190 

Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly 
195 200 205 

Ser Met lie lie Gly Gly lie Asp His Ser Leu Tyr Thr Gly Ser Leu 
210 215 220 

Trp Tyr Thr Pro lie Arg Arg Glu Trp Tyr Tyr Glu Val lie lie Val 
225 230 235 240 

Arg Val Glu lie Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr 
245 250 255 

Asn Tyr Asp Lys Ser lie Val Asp Ser Gly Thr Thr Asn Leu Arg Leu 
260 265 270 

Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser lie Lys Ala Ala Ser 
275 280 285 

Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val 
290 295 300 

Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser 
305 310 315 320 

Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He 
325 330 335 

Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin 
340 345 350 
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Asp Asp Cys Tyr Lys Phe Ala lie Ser Gin Ser Ser Thr Gly Thr Val 
355 360 365 



Met Gly Ala Val 
370 

Arg Lys Arg lie 
385 

Phe Arg Thr Ala 



Asp Cys Gly Tyr 
420 

He Ala Tyr Val 
435 

Cys Leu Met Val 
450 

His Asp Asp Phe 
465 



He Met Glu Gly 
375 

Gly Phe Ala Val 
390 

Ala Val Glu Gly 
405 

Asn He Pro Gin 



Met Ala Ala He 
440 

Cys Gin Trp Arg 
455 

Ala Asp Asp He 
470 



Phe Tyr Val Val 
380 

Ser Ala Cys His 
395 

Pro Phe Val Thr 
410 

Thr Asp Glu Ser 
425 

Cys Ala Leu Phe 



Cys Leu Arg Cys 
460 

Ser Leu Leu Lys 
475 



Phe Asp Arg Ala 



Val His Asp Glu 
400 

Leu Asp Met Glu 
415 

Thr Leu Met Thr 
430 

Met Leu Pro Leu 
445 

Leu Arg Gin Gin 



<210> 7 
<211> 2043 
<212> DNA 

<213> Mus musculus 



<400> 7 

atggccccag 

ggaacccatc 

ctgaggctgc 

gtggagatgg 

gtaggcagcc 

gtgggggctg 

tatcgagacc 

ctgggcaccg 

gctgccatca 

gggctggcct 

ctggtgaagc 

cccctcaacc 

gaccactcgc 

gaagtgatca 

tacaactacg 

gtatttgaag 

ggcttttggc 

ttcccagtca 

atccttcctc 

tacaagt teg 

ggtttctatg 

catgtgcacg 

gaagactgtg 

gtcatggcgg 

cgctgcctgc 

aagtaaggag 

ctttggtcac 

caccaacctg 

gggcttgeae 

ccttaggcac 

ctgcccagca 



cgctgcactg 
tcggcatccg 
cccgggagac 
tggacaacct 
ccccacagac 
ccccacaccc 
tccgaaaggg 
acctggtgag 
ctgaatcgga 
atgetgagat 
agacccacat 
agaccgaggc 
tatacaeggg 
ttgtacgtgt 
acaagagcat 
ctgccgtcaa 
taggggagca 
tttcacttta 
agcaatacct 
ctgtctcaca 
tegtcttega 
atgagttcag 
gctacaacat 
ccatctgcgc 
gttgcctgcg 
gctcgtgggc 
atgagttgga 
ccaatgcttc 
ctgtaggaca 
cacaaacttg 
tcctttagag 



gctcctgcta 
gctgcccctt 
tgacgaggaa 
gaggggaaag 
gctcaacatc 
tttcctgeat 
tgtgtatgtg 
catccctcat 
caagttcttc 
tgccaggccc 
tcccaacatc 
actggcctcg 
cagtctctgg 
ggaaatcaat 
tgtggacagt 
gtccatcaag 
gctggtgtgc 
cctcatgggt 
aeggceggtg 
gtcatccacg 
tcgagcccga 
gaeggeggea 
tccccagaca 
cctcttcatg 
ccaccagcac 
agatgatgga 
gctatggatg 
tggcgtgaca 
caggagaggg 
agttggaaat 
tctccaacct 



tgggtgggct 
cgcagcggcc 
teggaggage 
tccggccagg 
ctggtggaca 
cgctactacc 
ccctacaccc 
ggccccaacg 
atcaatggtt 
gacgactctt 
ttttccctgc 
gtgggaggga 
tacacaccca 
ggtcaagatc 
gggaccacca 
gcagcctcct 
tggcaagcag 
gaagtcacca 
gaggacgtgg 
ggcactgtta 
aagcgaattg 
gtggaaggtc 
gatgagtcaa 
ttgccactct 
gatgactttg 
gacgcccctg 
gtacctgtgg 
gaacagagaa 
aaggaagcag 
tttgetgett 
aaagtattct 



egggaatget 
tggcagggee 
ctggccggag 
gctactatgt 
egggcagtag 
agaggcagct 
agggcaagtg 
tcactgtgcg 
ccaactggga 
tggagecett 
agctctgtgg 
gcatgatcat 
teeggeggga 
tcaagatgga 
accttcgctt 
egaeggagaa 
gcacgacccc 
atcagtcctt 
ccacgtccca 
tgggagccgt 
getttgetgt 
cgtttgttac 
cacttatgac 
gcctcatggt 
ctgatgacat 
gaccacatct 
ccagagcacc 
atcaggcaag 
cgttctggtg 
gaagcttcag 
ttatgtcctt 



gcctgcccag 
acccctgggc 
aggcagcttt 
ggagatgacc 
taactttgea 
gtccagcaca 
ggagggggaa 
tgecaacatt 
gggcatccta 
ctttgactcc 
cgctggcttc 
tggtggtatc 
gtggtattat 
ctgeaaggag 
geccaagaaa 
gttcceggat 
t tggaacatt 
ccgcatcacc 
agacgactgt 
catcatggaa 
cagcgcttgc 
ggcagacatg 
catagectat 
atgtcagtgg 
ctccctgctc 
gggtggttcc 
tcaggaccct 
ctggat taca 
gcaggaatat 
ccctgaccct 
ccagaagtac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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tggcgtcata ctcaggctac ccggcatgtg tccctgtggt accctggcag agaaagggcc 1920 
aatctcattc cctgctggcc aaagtcagca gaagaaggtg aagtttgcca gttgctttag 1980 
tgatagggac tgcagactca agcctacact ggtacaaaga ctgcgtcttg agataaacaa 2040 
gaa 2043 

<210> 8 
<211> 501 
<212> PRT 

<213> Mus musculus 
<400> 8 

Met Ala Pro Ala Leu His Trp Leu Leu Leu Trp Val Gly Ser Gly Met 
1 5 10 15 

Leu Pro Ala Gin Gly Thr His Leu Gly lie Arg Leu Pro Leu Arg Ser 
20 25 30 

Gly Leu Ala Gly Pro Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Ser Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 " 70 75 8 0 

Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 
85 90 ~ 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 " 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn lie 
145 150 155 160 

Ala Ala lie Thr Glu Ser Asp Lys Phe Phe lie Asn Gly Ser Asn Trp 
165 170 175 

Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp 
180 185 190 

Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His He Pro 
195 200 205 

Asn He Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 

Thr Glu Ala Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He 
225 230 235 240 

Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg 
245 250 255 

Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin 
260 265 270 
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Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser lie Val 
275 * 280 285 

Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 
290 295 300 

Ala Val Lys Ser lie Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 
305 310 315 320 

Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 

Pro Trp Asn lie Phe Pro Val lie Ser Leu Tyr Leu Met Gly Glu Val 
340 345 350 

Thr Asn Gin Ser Phe Arg lie Thr lie Leu Pro Gin Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 380 

Val Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val lie Met Glu 
385 390 395 400 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg lie Gly Phe Ala 
4 05 410 415 

Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 

Gly Pro Phe Val Thr Ala Asp Met Glu Asp Cys Gly Tyr Asn lie Pro 
435 440 445 

Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val Met Ala Ala 
450 455 460 

He Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val Cys Gin Trp 
465 470 475 480 

Arg Cys Leu Arg Cys Leu Arg His Gin His Asp Asp Phe Ala Asp Asp 
485 490 495 



He Ser Leu Leu Lys 
500 



<210> 9 

<211> 2088 

<212> DNA 

<213> Homo sapiens 



<400> 9 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttgtaa 

atggatgttt 

aagagtacca 

ggggtagagt 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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gcggaggagg 
agtgaagaca 
gaagccgatg 
ccctacgaag 
gagtctgtgg 
gacaagtatc 
gagaggcttg 
gcagaacgtc 
caggagaaag 
acacacatgg 
tacatcaccg 
aagtatgtcc 
cgcatggtgg 
gtgatttatg 
gaggagattc 
gtcttggcca 
tctttgaccg 
gacgatctcc 
gaagttgagc 
tctgggttga 
cgacatgact 
ggttcaaaca 
atcgtcatca 
gtggaggttg 
ggctacgaaa 



atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
acgccgctgt 
atccaaccta 



tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
caccccagag 
caagttcttt 



ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caaaaattgg 
atggtgggcg 
aaacagtaca 
gagcgccacc 
gagcagatgc 



acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaagatgga 
tgttctttgc 
gtgttgtcat 
catccattca 
tgtccaagat 
agaactag 



tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
agcgacagtg 
tcatggtgtg 
gcagcagaac 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2088 



<210> 10 
<211> 695 
<212> PRT 

<213> Homo sapiens 



<400> 10 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala 
15 10 



Ala Trp Thr Ala Arg 
15 



Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 



Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 
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Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 " 300 



Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 "** 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin lie Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu lie Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 
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Tyr Ser Asp Asp 
500 

Tyr Gly Asn Asp 
515 

Val Glu Leu Leu 
530 

Pro Trp His Ser 
545 

Glu Val Glu Pro 



Thr Arg Pro Gly 
580 



Val Leu Ala Asn 



Ala Leu Met Pro 
520 

Pro Val Asn Gly 
535 

Phe Gly Ala Asp 
550 

Val Asp Ala Arg 
565 

Ser Gly Leu Thr 



- 13 - 

Met lie Ser Glu 
505 

Ser Leu Thr Glu 



Glu Phe Ser Leu 
540 

Ser Val Pro Ala 
555 

Pro Ala Ala Asp 
570 

Asn lie Lys Thr 
585 



Pro Arg lie Ser 
510 

Thr Lys Thr Thr 
525 

Asp Asp Leu Gin 



Asn Thr Glu Asn 
560 

Arg Gly Leu Thr 
575 

Glu Glu lie Ser 
590 



Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val lie Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

.His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 



Phe Phe 
690 



Glu Gin Met Gin Asn 
695 



<210> 11 

<211> 2088 

<212> DNA 

<213> Homo sapiens 



<400> 11 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttgtaa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 
agtgaagaca 
gaagccgatg 
ccctacgaag 
gagtctgtgg 
gacaagtatc 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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gagaggcttg 
gcagaacgtc 
caggagaaag 
acacacatgg 
tacatcaccg 
aagtatgtcc 
cgcatggtgg 
gtgatttatg 
gaggagattc 
gtcttggcca 
tctttgaccg 
gacgatctcc 
gaagttgagc 
tctgggttga 
cgacatgact 
ggttcaaaca 
atcgtcatca 
gtggaggttg 
ggctacgaaa 



aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
acgccgctgt 
atccaaccta 



ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
caccccagag 
caagttcttt 



atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caaaaattgg 
atggtgggcg 
aaacagtaca 
gagcgccacc 
gagcagatgc 



tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaatctgga 
tgttctttgc 
gtgttgtcat 
catccattca 
tgtccaagat 
agaactag 



atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
agcgacagtg 
tcatggtgtg 
gcagcagaac 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2088 



<210> 12 
<211> 695 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 



Ala Leu 



Gin He 



Asn Gly 
50 

Thr Lys 
65 

Gin He 



Trp Cys 

He Pro 

Val Pro 
130 

Glu Thr 
145 

Lys Ser 
Asp Lys 



Glu Val Pro 
20 

Ala Met Phe 
35 

Lys Trp Asp 

Glu Gly He 

Thr Asn Val 
85 

Lys Arg Gly 
100 

Tyr Arg Cys 
115 

Asp Lys Cys 
His Leu His 



Thr Asn Leu 
165 

Phe Arg Gly 
180 



Thr Asp 

Cys Gly 

Ser Asp 
55 

Leu Gin 
70 

Val Glu 

Arg Lys 

Leu Val 

Lys Phe 
135 

Trp His 
150 

His Asp 
Val Glu 



Gly Asn Ala Gly 
25 

Arg Leu Asn Met 
40 

Pro Ser Gly Thr 



Tyr Cys Gin Glu 
75 

Ala Asn Gin Pro 
90 

Gin Cys Lys Thr 
105 

Gly Glu Phe Val 
120 

Leu His Gin Glu 



Thr Val Ala Lys 
155 

Tyr Gly Met Leu 
170 

Phe Val Cys Cys 
185 



Leu Leu Ala 
30 

His Met Asn 
45 

Lys Thr Cys 
60 

Val Tyr Pro 

Val Thr He 

His Pro His 
110 

Ser Asp Ala 
125 

Arg Met Asp 
14 0 

Glu Thr Cys 
Leu Pro Cys 



Pro Leu Ala 
190 



Glu Pro 
Val Gin 
He Asp 



Glu Leu 
80 

Gin Asn 
95 

Phe Val 



Leu Leu 



Val Cys 



Ser Glu 
160 

Gly He 
175 

Glu Glu 
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Ser Asp Asn Val 
195 



Trp Trp Gly Gly 
210 

Val Val Glu Val 
225 

Glu Ala Asp Asp 



Glu Ala Glu Glu 
260 

Ala Thr Thr Thr 
275 

Val Pro Thr Thr 
290 

Glu Thr Pro Gly 
3 05 

Glu Arg Leu Glu 



Glu Trp Glu Glu 
340 

Lys Lys Ala Val 
355 

Gin Glu Ala Ala 
370 

Arg Val Glu Ala 
385 

Tyr lie Thr Ala 



Asn Met Leu Lys 
420 

Thr Leu Lys His 
435 

Ala Gin lie Arg 
450 

Arg Met Asn Gin 
465 

Glu Glu lie Gin 



Tyr Ser Asp Asp 
500 



Asp Ser 

Ala Asp 

Ala Glu 
230 

Asp Glu 
245 

Pro Tyr 

Thr Thr 

Ala Ala 

Asp Glu 
310 

Ala Lys 
325 

Ala Glu 

lie Gin 

Asn Glu 

Met Leu 
390 

Leu Gin 
405 

Lys Tyr 

Phe Glu 

Ser Gin 

Ser Leu 
470 

Asp Glu 
485 

Val Leu 



Ala Asp 
200 

Thr Asp 
215 

Glu Glu 

Asp Asp 

Glu Glu 

Thr Thr 
280 

Ser Thr 
295 

Asn Glu 

His Arg 

Arg Gin 

His Phe 
360 

Arg Gin 
375 

Asn Asp 

Ala Val 

Val Arg 

His Val 
440 

Val Met 
455 

Ser Leu 
Val Asp 

Ala Asn 



- 15 
Ala Glu 

Tyr Ala 

Glu Val 

Glu Asp 
250 

Ala Thr 
265 

Glu Ser 

Pro Asp 

His Ala 

Glu Arg 
330 

Ala Lys 
345 

Gin Glu 

Gin Leu 

Arg Arg 

Pro Pro 
410 

Ala Glu 
42 5 

Arg Met 
Thr His 



Leu Tyr 



Glu Leu 
490 



Met lie 
505 



Glu Asp 

Asp Gly 
220 

Ala Glu 
235 

Gly Asp 

Glu Arg 

Val Glu 

Ala Val 
300 

His Phe 
315 

Met Ser 

Asn Leu 

Lys Val 

Val Glu 
380 

Arg Leu 
395 

Arg Pro 

Gin Lys 

Val Asp 

Leu Arg 
460 

Asn Val 
475 

Leu Gin 
Ser Glu 



Asp Ser 
205 

Ser Glu 

Val Glu 

Glu Val 

Thr Thr 
270 

Glu Val 
285 

Asp Lys 

Gin Lys 

Gin Val 

Pro Lys 
350 

Glu Ser 
365 

Thr His 

Ala Leu 

Arg His 

Asp Arg 
430 

Pro Lys 
445 

Val lie 
Pro Ala 
Lys Glu 

Pro Arg 
510 



Asp Val 

Asp Lys 

Glu Glu 
240 

Glu Glu 
255 

Ser lie 

Val Arg 

Tyr Leu 

Ala Lys 
320 

Met Arg 

335 

Ala Asp 
Leu Glu 



Met Ala 

Glu Asn 
400 

Val Phe 
415 

Gin His 

Lys Ala 

Tyr Glu 

Val Ala 
480 

Gin Asn 
495 

lie Ser 
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Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu lie Ser 
580 585 590 

Glu Val Asn Leu Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala lie He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 " 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

Phe Phe Glu Gin Met Gin Asn 
690 695 



<210> 13 

<211> 2088 

<212> DNA 

<213> Homo sapiens 



<400> 13 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttgtaa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 

99 c 99 a 9 ca 9 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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tacatcaccg 
aagtatgtcc 
cgcatggtgg 
gtgatttatg 
gaggagattc 
gtcttggcca 
tctttgaccg 
gacgatctcc 
gaagttgagc 
tctgggttga 
cgacatgact 
ggttcaaaca 
atcttcatca 
gtggaggttg 
ggctacgaaa 



ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
acgccgctgt 
atccaaccta 



tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
caccccagag 
caagttcttt 



- 17 - 

cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caaaaattgg 
atggtgggcg 
aaacagtaca 
gagcgccacc 
gagcagatgc 



acgtgt tcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaagatgga 
tgttctttgc 
gtgttgtcat 
catccattca 
tgtccaagat 
agaactag 



tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
agcgacagtg 
tcatggtgtg 
gcagcagaac 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2088 



<210> 14 
<211> 695 
<212> PRT 

<213> Homo sapiens 
<400> 14 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 



Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 



He Pro Tyr Arg Cys Leu Val Gly 
115 120 

Val Pro Asp Lys Cys Lys Phe Leu 
130 135 

Glu Thr His Leu His Trp His Thr 
145 150 

Lys Ser Thr Asn Leu His Asp Tyr 
165 

Asp Lys Phe Arg Gly Val Glu Phe 
180 



Glu Phe Val Ser Asp Ala Leu Leu 
125 

His Gin Glu Arg Met Asp Val Cys 
140 

Val Ala Lys Glu Thr Cys Ser Glu 
155 160 

Gly Met Leu Leu Pro Cys Gly He 
170 175 

Val Cys Cys Pro Leu Ala Glu Glu 
185 190 



Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 
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Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 



Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 ' 535 540 
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Pro Trp 
545 



His Ser 



Glu Val Glu Pro 



Thr Arg 
Glu Val 



His His 
610 

Gly Ala 
625 



Pro Gly 
580 

Lys Met 
595 

Gin Lys 
lie lie 



lie Phe lie Thr 
His His 



His Leu 



Gly Val 
660 

Ser Lys 
675 



Ala Asp 
Ala Arg 
Ser Gly Leu Thr 



Phe Gly 
550 

Val Asp 
565 



- 19- 
Ser Val 



Pro Ala 
570 

Asn lie 
585 



Asp Ala 
Leu Val 



Gly Leu 
630 

Leu Val 
645 



Glu Phe 
600 



Arg His 
Ala Glu 
Met Val Gly Gly 



Phe Phe 
615 



Pro Ala 
555 

Ala Asp 
Lys Thr 
Asp Ser 



Asp val 
620 

Val Val 
635 



Met Leu 



Val Glu Val Asp 

Met Gin Gin Asn 
680 



Lys Gin 
Val Thr 
Gly Tyr Glu Asn 



Lys Lys 
65 0 

Ala Ala 
665 



Asn Thr Glu Asn 
560 

Arg Gly Leu Thr 
575 

Glu Glu lie Ser 
590 

Gly Tyr Glu val 
605 

Gly Ser Asn Lys 



He Ala Thr Val 
640 

Tyr Thr Ser He 
655 

Pro Glu Glu Arg 
670 

Pro Thr Tyr Lys 
685 



Phe Phe Glu Gin Met Gin Asn 
690 695 



<210> 15 
<211> 2094 
<212> DNA 

<213> Homo sapiens 



<400> 15 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttgtaa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
ct tgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tea tgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgeaagegg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgetgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgetaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 



60 
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240 

300 

360 
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480 

540 
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gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 

tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg X620 

gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 

gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 

tctgggttga caaatatcaa gacggaggag atctctgaag tgaagatgga tgcagaattc 1800 

cgacatgact caggatatga agttcatcat caaaaattgg tgttctttgc agaagatgtg 1860 

ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 

atcgtcatca ccttggtgat gctgaagaag aaacagtaca catccattca tcatggtgtg 1980 

gtggaggttg acgccgctgt caccccagag gagcgccacc tgtccaagat gcagcagaac 2040 

ggctacgaaa atccaaccta caagttcttt gagcagatgc agaacaagaa gtag 2094 

<210> 16 
<211> 697 
<212> PRT 

<213> Homo sapiens 
<400> 16 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 * 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr lie Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 ~ 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 ~ 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 
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Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 



Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin lie Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu lie Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met lie Ser Glu Pro Arg lie Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 
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Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu lie Ser 
580 585 * 590 



Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala lie He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

Phe Phe Glu Gin Met Gin Asn Lys Lys 
690 6 95 



<210> 17 

<211> 2094 

<212> DNA 

<213> Homo sapiens 



<400> 17 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttgtaa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 

ggcggagcag 

gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caaaaattgg 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaatctgga 
tgttctttgc 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
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ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 

atcgtcatca ccttggtgat gctgaagaag aaacagtaca catccattca tcatggtgtg 1980 

gtggaggttg acgccgctgt caccccagag gagcgccacc tgtccaagat gcagcagaac 2040 

ggctacgaaa atccaaccta caagttcttt gagcagatgc agaacaagaa gtag 2094 

<210> 18 
<211> 697 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
13 0 13 5 14 0 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 



Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 

225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 * " 250 255 



Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 
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Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin lie Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 



Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 

Glu Val Asn Leu Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 



WO 01/23533 



PCT/US00/26080 



-25 - 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala lie lie Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

Phe Phe Glu Gin Met Gin Asn Lys Lys 
690 695 



<210> 19 
<211> 2094 
<212> DNA 

<213> Homo sapiens 



<400> 19 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttgtaa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggt tcaaaca 

atcttcatca 

gtggaggttg 

ggctacgaaa 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
acgccgctgt 
atccaaccta 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
caccccagag 
caagttcttt 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caaaaattgg 
atggtgggcg 
aaacagtaca 
gagcgccacc 
gagcagatgc 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaagatgga 
tgttctttgc 
gtgttgtcat 
catccattca 
tgtccaagat 
agaacaagaa 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
agcgacagtg 
tcatggtgtg 
gcagcagaac 
gtag 
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<210> 20 
<211> 697 
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<212> PRT 

<213> Homo sapi< 

<400> 20 
Met Leu Pro Gly 
1 

Ala Leu Glu Val 
20 

Gin He Ala Met 
35 

Asn Gly Lys Trp 
50 

Thr Lys Glu Gly 
65 

Gin He Thr Asn 



Trp Cys Lys Arg 
100 



ns 



Leu Ala Leu Leu 
5 

Pro Thr Asp Gly 



Phe Cys Gly Arg 
40 

Asp Ser Asp Pro 
55 

He Leu Gin Tyr 
70 

Val Val Glu Ala 
85 

Gly Arg Lys Gin 



-26- 



Leu Leu Ala Ala 
10 

Asn Ala Gly Leu 
25 

Leu Asn Met His 



Ser Gly Thr Lys 
60 

Cys Gin Glu Val 
75 

Asn Gin Pro Val 
90 

Cys Lys Thr His 
105 



Trp Thr Ala Arg 

15 

Leu Ala Glu Pro 
30 

Met Asn Val Gin 
45 

Thr Cys He Asp 



Tyr Pro Glu Leu 
80 

Thr He Gin Asn 
95 

Pro His Phe Val 
110 



He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 * 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 
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Glu Thr Pro Gly 
305 

Glu Arg Leu Glu 



Glu Trp Glu Glu 
340 

Lys Lys Ala Val 
355 

Gin Glu Ala Ala 
370 

Arg Val Glu Ala 
385 



Tyr lie Thr Ala 



Asn Met Leu Lys 
420 

Thr Leu Lys His 
435 

Ala Gin lie Arg 
450 

Arg Met Asn Gin 
465 

Glu Glu lie Gin 



Tyr Ser Asp Asp 
500 

Tyr Gly Asn Asp 
515 

Val Glu Leu Leu 
530 

Pro Trp His Ser 
545 

Glu Val Glu Pro 



Thr Arg Pro Gly 
580 

Glu Val Lys Met 
595 

His His Gin Lys 
610 

Gly Ala lie He 
625 



Asp Glu 
310 

Ala Lys 
325 

Ala Glu 

He Gin 

Asn Glu 

Met Leu 
390 

Leu Gin 
405 

Lys Tyr 
Phe Glu 
Ser Gin 



Ser Leu 
470 

Asp Glu 
485 

Val Leu 

Ala Leu 

Pro Val 

Phe Gly 
550 

Val Asp 
565 

Ser Gly 

Asp Ala 

Leu Val 

Gly Leu 
630 



Asn Glu 

His Arg 

Arg Gin 

His Phe 
360 

Arg Gin 
375 

Asn Asp 

Ala Val 

Val Arg 

His Val 
440 

Val Met 
455 

Ser Leu 

Val Asp 

Ala Asn 

Met Pro 
520 

Asn Gly 
53 5 

Ala Asp 

Ala Arg 

Leu Thr 

Glu Phe 
600 

Phe Phe 
615 

Met Val 



-27 
His Ala 

Glu Arg 
330 

Ala Lys 
345 

Gin Glu 



Gin Leu 



Arg Arg 

Pro Pro 
410 

Ala Glu 
425 

Arg Met 

Thr His 

Leu Tyr 

Glu Leu 
490 

Met He 
505 

Ser Leu 

Glu Phe 

Ser Val 

Pro Ala 
570 

Asn He 
585 

Arg His 
Ala Glu 
Gly Gly 



His Phe 
315 

Met Ser 

Asn Leu 

Lys Val 

Val Glu 
380 

Arg Leu 
395 

Arg Pro 

Gin Lys 

Val Asp 

Leu Arg 
460 

Asn Val 
475 

Leu Gin 
Ser Glu 



Thr Glu 

Ser Leu 
540 

Pro Ala 
555 

Ala Asp 

Lys Thr 

Asp Ser 

Asp Val 
620 

val Val 
635 



Gin Lys 

Gin Val 

Pro Lys 
350 

Glu Ser 
365 

Thr His 
Ala Leu 

Arg His 

Asp Arg 
430 

Pro Lys 
445 

Val He 

Pro Ala 

Lys Glu 

Pro Arg 
510 

Thr Lys 
525 

Asp Asp 

Asn Thr 

Arg Gly 

Glu Glu 
590 

Gly Tyr 
605 

Gly Ser 
He Ala 



Ala Lys 
320 

Met Arg 
335 

Ala Asp 

Leu Glu 

Met Ala 

Glu Asn 
400 

Val Phe 
415 

Gin His 

Lys Ala 

Tyr Glu 

Val Ala 
480 

Gin Asn 
495 

He Ser 
Thr Thr 



Leu Gin 

Glu Asn 
560 

Leu Thr 
575 

He Ser 

Glu Val 

Asn Lys 

Thr Val 
640 
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He Phe He Thr 



His His Gly Val 
660 

His Leu Ser Lys 
675 



Leu Val Met Leu 
645 

Val Glu Val Asp 



Met Gin Gin Asn 
680 



-28- 

Lys Lys Lys Gin 
650 

Ala Ala Val Thr 
665 

Gly Tyr Glu Asn 



Tyr Thr Ser He 
655 

Pro Glu Glu Arg 
670 

Pro Thr Tyr Lys 
685 



Phe Phe Glu Gin Met Gin Asn Lys Lys 
690 695 



<210> 21 

<211> 1341 

<212> DNA 

<213> Homo sapiens 



<400> 21 

atggctagca 

ctgcccctgc 

gacgaagagc 

aggggcaagt 

ctcaacatcc 

ttcctgcatc 

gtgtatgtgc 

atcccccatg 

aagttcttca 

gccaggcctg 

cccaacctct 

ctggcctctg 

agtctctggt 

gagatcaatg 

gtggacagtg 

tccatcaagg 

ctggtgtgct 

ctaatgggtg 

cggccagtgg 

tcatccacgg 

cgggcccgaa 

acggcagcgg 

ccacagacag 



tgactggtgg 
gcagcggcct 
ccgaggagcc 
cggggcaggg 
tggtggatac 
gctactacca 
cctacaccca 
gccccaacgt 
tcaacggctc 
acgactccct 
tctccctgca 
tcggagggag 
atacacccat 
gacaggatct 
gcaccaccaa 
cagcctcctc 
ggcaagcagg 
aggttaccaa 
aagatgtggc 
gcactgttat 
aacgaattgg 
tggaaggccc 
atgagtcatg 



acagcaaatg 
ggggggcgcc 
cggccggagg 
ctactacgtg 
aggcagcagt 
gaggcagctg 
gggcaagtgg 
cactgtgcgt 
caactgggaa 
ggagcctttc 
cctttgtggt 
catgatcatt 
ccggcgggag 
gaaaatggac 
ccttcgtttg 
cacggagaag 
caccacccct 
ccagtccttc 
cacgtcccaa 
gggagctgtt 
ctttgctgtc 
ttttgtcacc 
a 



ggtcgcggat 
cccctggggc 
ggcagctttg 
gagatgaccg 
aactttgcag 
tccagcacat 
gaaggggagc 
gccaacattg 
ggcatcctgg 
tttgactctc 
gctggcttcc 
ggaggtatcg 
tggtattatg 
tgcaaggagt 
cccaagaaag 
ttccctgatg 
tggaacattt 
cgcatcacca 
gacgactgtt 
atcatggagg 
agcgcttgcc 
ttggacatgg 



ccacccagca 
tgcggctgcc 
tggagatggt 
tgggcagccc 
tgggtgctgc 
accgggacct 
tgggcaccga 
ctgccatcac 
ggctggccta 
tggtaaagca 
ccctcaacca 
accactcgct 
aggtcatcat 
acaactatga 
tgtttgaagc 
gtttctggct 
tcccagtcat 
tccttccgca 
acaagtttgc 
gcttctacgt 
atgtgcacga 
aagactgtgg 



cggcatccgg 
ccgggagacc 
ggacaacctg 
cccgcagacg 
cccccacccc 
ccggaagggt 
cctggtaagc 
tgaatcagac 
tgctgagatt 
gacccacgtt 
gtctgaagtg 
gtacacaggc 
tgtgcgggtg 
caagagcatt 
tgcagtcaaa 
aggagagcag 
ctcactctac 
gcaatacctg 
catctcacag 
tgtctttgat 
tgagttcagg 
ctacaacatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1341 



<210> 22 
<211> 446 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Thr Gin 
15 10 15 

His Gly He Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu 
20 25 30 



Gly Leu Arg Leu Pro Arg Glu Thr Asp Glu Glu Pro Glu Glu Pro Gly 
35 40 45 



Arg Arg Gly Ser Phe Val Glu Met Val Asp Asn Leu Arg Gly Lys Ser 
.50 55 60 
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Gly Gin Gly Tyr 
65 

Leu Asn lie Leu 



Ala Pro His Pro 
100 

Thr Tyr Arg Asp 
115 

Lys Trp Glu Gly 
130 

Pro Asn Val Thr 
145 

Lys Phe Phe lie 



Tyr Ala Glu lie 
180 

Ser Leu Val Lys 
195 

Cys Gly Ala Gly 
210 

Gly Gly Ser Met 
225 

Ser Leu Trp Tyr 



lie Val Arg Val 
260 

Glu Tyr Asn Tyr 
275 

Arg Leu Pro- Lys 
290 

Ala Ser Ser Thr 
305 

Leu Val Cys Trp 



lie Ser Leu Tyr 
340 

Thr lie Leu Pro 
355 



Ser Gin Asp Asp 
370 

Thr Val Met Gly 
385 



Tyr Val 
70 

Val Asp 
85 

Phe Leu 



Leu Arg 



Glu Leu 

Val Arg 
150 

Asn Gly 
165 

Ala Arg 

Gin Thr 

Phe Pro 

He He 
230 

Thr Pro 
245 

Glu He 

Asp Lys 

Lys Val 

Glu Lys 
310 

Gin Ala 
325 

Leu Met 
Gin Gin 

Cys Tyr 

Ala Val 
390 



Glu Met 

Thr Gly 

His Arg 

Lys Gly 
120 

Gly Thr 
135 

Ala Asn 

Ser Asn 

Pro Asp 

His Val 
200 

Leu Asn 
215 

Gly Gly 

He Arg 

Asn Gly 

Ser He 
280 

Phe Glu 
295 

Phe Pro 

Gly Thr 

Gly Glu 

Tyr Leu 
360 

Lys Phe 
375 

He Met 
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Thr Val 

Ser Ser 
90 

Tyr Tyr 
105 

Val Tyr 

Asp Leu 

He Ala 

Trp Glu 
170 

Asp Ser 
185 

Pro Asn 



Gin Ser 



He Asp 

Arg Glu 
250 

Gin Asp 
265 

Val Asp 

Ala Ala 

Asp Gly 

Thr Pro 
330 

Val Thr 
345 

Arg Pro 



Ala He 



Glu Gly 



Gly Ser 
75 

Asn Phe 

Gin Arg 

Val Pro 

Val Ser 
140 

Ala He 
155 

Gly He 

Leu Glu 

Leu Phe 

Glu Val 
220 

His Ser 
235 

Trp Tyr 

Leu Lys 

Ser Gly 

Val Lys 
300 

Phe Trp 
315 

Trp Asn 
Asn Gin 
Val Glu 

Ser Gin 
380 

Phe Tyr 
395 



Pro Pro 

Ala Val 

Gin Leu 
110 

Tyr Thr 
125 

He Pro 

Thr Glu 

Leu Gly 

Pro Phe 
190 

Ser Leu 
205 

Leu Ala 



Leu Tyr 



Tyr Glu 

Met Asp 
270 

Thr Thr 
285 

Ser He 

Leu Gly 

He Phe 

Ser Phe 
350 

Asp Val 
365 

Ser Ser 
Val Val 



Gin Thr 
80 

Gly Ala 
95 

Ser Ser 

Gin Gly 

His Gly 

Ser Asp 
160 

Leu Ala 
175 

Phe Asp 

His Leu 

Ser Val 

Thr Gly 
240 

Val He 
255 

Cys Lys 
Asn Leu 



Lys Ala 

Glu Gin 
320 

Pro Val 
335 

Arg He 
Ala Thr 

Thr Gly 

Phe Asp 
400 
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Arg Ala Arg Lys Arg lie Gly Phe 
405 

Asp Glu Phe Arg Thr Ala Ala Val 
420 

Met Glu Asp Cys Gly Tyr Asn lie 

435 440 



-30- 

Ala Val Ser Ala Cys His Val His 
410 415 

Glu Gly Pro Phe Val Thr Leu Asp 
425 430 

Pro Gin Thr Asp Glu Ser 
445 



<210> 23 

<211> 1380 

<212> DNA 

<213> Homo sapiens 



<400> 23 

atggctagca 

ccgcgtgaac 

gggggcgccc 

ggccggaggg 
tactacgtgg 
ggcagcagta 
aggcagctgt 
ggcaagtggg 
actgtgcgtg 
aactgggaag 
gagcctttct 
ctttgtggtg 
atgatcattg 
cggcgggagt 
aaaatggact 
cttcgtttgc 
acggagaagt 
accacccctt 
cagtccttcc 
acgtcccaag 
ggagctgtta 
tttgctgtca 
tttgtcacct 



tgactggtgg 
aggacggatc 
ccctggggct 
gcagctttgt 
agatgaccgt 
actttgcagt 
ccagcacata 
aaggggagct 
ccaacattgc 
gcatcctggg 
ttgactctct 
ctggcttccc 
gaggtatcga 
ggtattatga 
gcaaggagta 
ccaagaaagt 
tccctgatgg 
ggaacatttt 
gcatcaccat 
acgactgtta 
tcatggaggg 
gcgcttgcca 
tggacatgga 



acagcaaatg 
cacccagcac 
gcggctgccc 
ggagatggtg 
gggcagcccc 
gggtgctgcc 
ccgggacctc 
gggcaccgac 
tgccatcact 
gctggcctat 
ggtaaagcag 
cctcaaccag 
ccactcgctg 
ggtcatcatt 
caactatgac 
gtttgaagct 
tttctggcta 
cccagtcatc 
ccttccgcag 
caagtttgcc 
cttctacgtt 
tgtgcacgat 
agactgtggc 



ggtcgcggat 
ggcatccggc 
cgggagaccg 
gacaacctga 
ccgcagacgc 
ccccacccct 
cggaagggtg 
ctggtaagca 
gaatcagaca 
gctgagattg 
acccacgttc 
tctgaagtgc 
tacacaggca 
gtgcgggtgg 
aagagcattg 
gcagtcaaat 
ggagagcagc 
tcactctacc 
caatacctgc 
atctcacagt 
gtctttgatc 
gagttcagga 
tacaacattc 



<210> 24 
<211> 459 
<212> PRT 

<213> Homo sapiens 
<400> 24 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly 
15 10 



cgatgactat 
tgcccctgcg 
acgaagagcc 
ggggcaagtc 
tcaacatcct 
tcctgcatcg 
tgtatgtgcc 
tcccccatgg 
agttcttcat 
ccaggcctga 
ccaacctctt 
tggcctctgt 
gtctctggta 
agatcaatgg 
tggacagtgg 
ccatcaaggc 
tggtgtgctg 
taatgggtga 
ggccagtgga 
catccacggg 
gggcccgaaa 
cggcagcggt 
cacagacaga 



ctctgactct 
cagcggcctg 
cgaggagccc 
ggggcagggc 

ggtggataca 

ctactaccag 
ctacacccag 
ccccaacgtc 
caacggctcc 
cgactccctg 
ctccctgcac 
cggagggagc 
tacacccatc 
acaggatctg 
caccaccaac 
agcctcctcc 
gcaagcaggc 
ggttaccaac 
agatgtggcc 
cactgttatg 
acgaattggc 
ggaaggccct 
tgagtcatga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 



Arg Gly Ser Met Thr 
15 



lie Ser Asp Ser Pro Arg Glu Gin Asp Gly Ser Thr Gin His Gly lie 
20 25 30 

Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg 
35 40 45 

Leu Pro Arg Glu Thr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly 
50 55 6 0 

Ser Phe Val Glu Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly 
65 70 75 80 



Tyr Tyr Val Glu Met Thr Val Gly Ser Pro Pro Gin Thr Leu Asn lie 
85 90 95 
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Leu Val Asp Thr Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His 
100 105 110 

Pro Phe Leu His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg 
115 120 125 

Asp Leu Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu 
130 " " 135 140 

Gly Glu Leu Gly Thr Asp Leu Val Ser lie Pro His Gly Pro Asn Val 
145 150 155 160 

Thr Val Arg Ala Asn He Ala Ala He Thr Glu Ser Asp Lys Phe Phe 
165 170 175 

He Asn Gly Ser Asn Trp Glu Gly He Leu Gly Leu Ala Tyr Ala Glu 
180 185 190 

He Ala Arg Pro Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val 
195 200 205 

Lys Gin Thr His Val Pro Asn Leu Phe Ser Leu His Leu Cys Gly Ala 
210 215 220 

Gly Phe Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser 
225 230 235 240 

Met He He Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu Trp 
245 250 255 

Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val Arg 
260 265 270 

Val Glu He Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn 
275 280 285 

Tyr Asp Lys Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro 
290 295 300 



Lys Lys Val Phe Glu 
305 

Thr Glu Lys Phe Pro 
325 

Trp Gin Ala Gly Thr 
340 

Tyr Leu Met Gly Glu 
355 

Pro Gin Gin Tyr Leu 
370 

Asp Cys Tyr Lys Phe 
385 

Gly Ala Val He Met 
405 

Lys Arg lie Gly Phe 
420 



Ala Ala Val Lys Ser 
310 

Asp Gly Phe Trp Leu 
330 

Thr Pro Trp Asn He 
345 

Val Thr Asn Gin Ser 
360 

Arg Pro Val Glu Asp 
375 

Ala He Ser Gin Ser 
390 

Glu Gly Phe Tyr Val 
410 

Ala Val Ser Ala Cys 
425 



He Lys Ala Ala Ser Ser 
315 320 

Gly Glu Gin Leu Val Cys 
335 

Phe Pro Val He Ser Leu 
350 

Phe Arg He Thr He Leu 
365 

Val Ala Thr Ser Gin Asp 
380 

Ser Thr Gly Thr Val Met 
395 400 

Val Phe Asp Arg Ala Arg 
415 

His Val His Asp Glu Phe 
430 
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Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp 
435 440 445 

Cys Gly Tyr Asn lie Pro Gin Thr Asp Glu Ser 
450 455 



<210> 25 

<211> 1302 

<212> DNA 

<213> Homo sapiens 



<400> 25 

atgactcagc 

ctgcgtctgc 

gtggagatgg 

gtgggcagcc 

gtgggtgctg 

taccgggacc 

ctgggcaccg 

gctgccatca 

gggctggcct 

ctggtaaagc 

cccctcaacc 

gaccactcgc 

gaggtcatca 

tacaactatg 

gtgtttgaag 

ggtttctggc 

ttcccagtca 

atccttccgc 

tacaagtttg 

ggcttctacg 

catgtgcacg 

gaagactgtg 



atggtattcg 
cccgggagac 
tggacaacct 
ccccgcagac 
ccccccaccc 
tccggaaggg 
acctggtaag 
ctgaatcaga 
atgctgagat 
agacccacgt 
agtctgaagt 
tgtacacagg 
ttgtgcgggt 
acaagagcat 
ctgcagtcaa 
taggagagca 
tctcactcta 
agcaatacct 
ccatctcaca 
ttgtctttga 
atgagttcag 
gctacaacat 



tctgccactg 
cgacgaagag 
gaggggcaag 
gctcaacatc 
cttcctgcat 
tgtgtatgtg 
catcccccat 
caagttcttc 
tgccaggcct 
tcccaacctc 
gctggcctct 
cagtctctgg 
ggagatcaat 
tgtggacagt 
atccatcaag 
gctggtgtgc 
cctaatgggt 
gcggccagtg 
gtcatccacg 
tcgggcccga 
gacggcagcg 
tccacagaca 



cgtagcggtc 
cccgaggagc 
tcggggcagg 
ctggtggata 
cgctactacc 
ccctacaccc 
ggccccaacg 
atcaacggct 
gacgactccc 
ttctccctgc 
gtcggaggga 
tatacaccca 
ggacaggatc 
ggcaccacca 
gcagcctcct 
tggcaagcag 
gaggttacca 
gaagatgtgg 
ggcactgtta 
aaacgaattg 
gtggaaggcc 
gatgagtcat 



tgggtggtgc 
ccggccggag 
gctactacgt 
caggcagcag 
agaggcagct 
agggcaagtg 
tcactgtgcg 
ccaactggga 
tggagccttt 
acctttgtgg 
gcatgatcat 
tccggcggga 
tgaaaatgga 
accttcgttt 
ccacggagaa 
gcaccacccc 
accagtcctt 
ccacgtccca 
tgggagctgt 
gctttgctgt 
cttttgtcac 
ga 



tccactgggt 
gggcagcttt 
ggagatgacc 
taactttgca 
gtccagcaca 
ggaaggggag 
tgccaacatt 
aggcatcctg 
ctttgactct 
tgctggcttc 
tggaggtatc 
gtggtattat 
ctgcaaggag 
gcccaagaaa 
gttccctgat 
ttggaacatt 
ccgcatcacc 
agacgactgt 
tatcatggag 
cagcgcttgc 
cttggacatg 



<210> 26 

<211> 433 

<212> PRT 

<213> Homo sapiens 

<400> 26 

Met Thr Gin His Gly lie Arg Leu Pro Leu Arg 
15 10 

Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr 
20 25 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1302 



Ser Gly Leu Gly Gly 
15 

Asp Glu Glu Pro Glu 
30 



Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val Asp Asn Leu Arg 
3 5 4 0 45 

Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr Val Gly Ser Pro 
50 55 60 



Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser Ser Asn Phe Ala 
65 70 75 80 

Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr Tyr Gin Arg Gin 
85 90 95 



Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val Tyr Val Pro Tyr 
100 105 110 
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Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp Leu Val Ser lie 
115 120 125 

Pro His Gly Pro Asn Val Thr Val Arg Ala Asn lie Ala Ala He Thr 
130 135 140 

Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp Glu Gly He Leu 
145 150 155 160 

Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp Ser Leu Glu Pro 
165 170 175 

Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro Asn Leu Phe Ser 
180 185 190 

Leu His Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu 
195 200 205 

Ala Ser Val Gly Gly Ser Met He He Gly Gly He Asp His Ser Leu 
210 215 220 

Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr 
225 230 235 240 

Glu Val He He Val Arg Val Glu He Asn Gly Gin Asp Leu Lys Met 
245 250 255 

Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val Asp Ser Gly Thr 
260 " 265 270 

Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser 
275 280 285 

He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu 
290 295 300 

Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He 
305 310 315 320 

Phe Pro Val lie Ser Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser 
325 330 335 

Phe Arg He Thr lie Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp 
340 345 350 

Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala He Ser Gin Ser 
355 360 365 

Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu Gly Phe Tyr Val 
370 375 380 

Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys 
385 390 395 400 

His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val 
405 410 415 

Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn He Pro Gin Thr Asp Glu 
420 425 430 

Ser 
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<210> 27 
<211> 1278 
<212> DNA 

<213> Homo sapiens 



<400> 27 

atggctagca 

ccgctggact 

ggcaagtcgg 

aacatcctgg 

ctgcatcgct 

tatgtgccct 

ccccatggcc 

ttcttcatca 

aggcctgacg 

aacctcttct 

gcctctgtcg 

ctctggtata 

atcaatggac 

gacagtggca 

atcaaggcag 

gtgtgctggc 

atgggtgagg 

ccagtggaag 

tccacgggca 

gcccgaaaac 

gcagcggtgg 

cagacagatg 



tgactggtgg 
ctggtatcga 
ggcagggcta 
tggatacagg 
actaccagag 
acacccaggg 
ccaacgtcac 
acggctccaa 
actccctgga 
ccctgcacct 
gagggagcat 
cacccatccg 
aggatctgaa 
ccaccaacct 
cctcctccac 
aagcaggcac 
ttaccaacca 
atgtggccac 
ctgttatggg 
gaattggctt 
aaggcccttt 
agtcatga 



acagcaaatg 
aaccgacgga 
ctacgtggag 
cagcagtaac 
gcagctgtcc 
caagtgggaa 
tgtgcgtgcc 
ctgggaaggc 
gcctttcttt 
ttgtggtgct 
gatcattgga 
gcgggagtgg 
aatggactgc 
tcgtttgccc 
ggagaagttc 
caccccttgg 
gtccttccgc 
gtcccaagac 
agctgttatc 
tgctgtcagc 
tgtcaccttg 



ggtcgcggat 
tcctttgtgg 
atgaccgtgg 
tttgcagtgg 
agcacatacc 
ggggagctgg 
aacattgctg 
atcctggggc 
gactctctgg 
ggcttccccc 
ggtatcgacc 
tattatgagg 
aaggagtaca 
aagaaagtgt 
cctgatggtt 
aacattttcc 
atcaccatcc 
gactgttaca 
atggagggct 
gcttgccatg 
gacatggaag 



cgatgactat 
agatggtgga 
gcagcccccc 
gtgctgcccc 
gggacctccg 
gcaccgacct 
ccatcactga 
tggcctatgc 
taaagcagac 
tcaaccagtc 
actcgctgta 
tcatcattgt 
actatgacaa 
ttgaagctgc 
tctggctagg 
cagtcatctc 
ttccgcagca 
agtttgccat 
tctacgttgt 
tgcacgatga 
actgtggcta 



ctctgactct 
caacctgagg 
gcagacgctc 
ccaccccttc 
gaagggtgtg 
ggtaagcatc 
atcagacaag 
tgagattgcc 
ccacgttccc 
tgaagtgctg 
cacaggcagt 
gcgggtggag 
gagcattgtg 
agtcaaatcc 
agagcagctg 
actctaccta 
atacctgcgg 
ctcacagtca 
ctttgatcgg 
gttcaggacg 
caacattcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1278 



<210> 28 
<211> 425 
<212> PRT 

<213> Homo sapiens 
<400> 28 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met Thr 
15 10 15 

lie Ser Asp Ser Pro Leu Asp Ser Gly lie Glu Thr Asp Gly Ser Phe 
20 25 30 

Val Glu Met Val Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr 
35 40 45 

Val Glu Met Thr Val Gly Ser Pro Pro Gin Thr Leu Asn He Leu Val 
50 55 60 

Asp Thr Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe 
65 70 75 80 

Leu His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu 
85 90 95 

Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu 
100 105 110 

Leu Gly Thr Asp Leu Val Ser He Pro His Gly Pro Asn Val Thr Val 
115 120 125 



Arg Ala Asn He Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn 
130 135 140 
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Gly Ser Asn Trp Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala 
145 150 155 160 

Arg Pro Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin 
165 170 175 

Thr His Val Pro Asn Leu Phe Ser Leu His Leu Cys Gly Ala Gly Phe 
180 185 190 

Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He 
195 200 205 

He Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr 
210 215 220 

Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu 
225 " 230 235 240 

He Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp 
245 250 255 

Lys Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys 
260 265 270 

Val Phe Glu Ala Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu 
275 280 285 

Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin 
290 295 300 

Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu 
305 310 315 320 

Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin 
325 330 335 

Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys 
340 345 350 

Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala 
355 360 365 

Val He Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg 
370 375 380 

He Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr 
385 390 395 400 

Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly 
405 410 415 

Tyr Asn He Pro Gin Thr Asp Glu Ser 
420 425 



<210> 29 
<211> 1362 
<212> DNA 

<213> Homo sapiens 
<400> 29 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 
ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccctgggg 120 
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ctgcggctgc 
gtggagatgg 
gtgggcagcc 
gtgggtgctg 
taccgggacc 
ctgggcaccg 
gctgccatca 
gggctggcct 
ctggtaaagc 
cccctcaacc 
gaccactcgc 
gaggtcatca 
tacaactatg 
gtgtttgaag 
ggt ttctggc 
ttcccagtca 
atccttccgc 
tacaagtttg 
ggcttctacg 
catgtgcacg 
gaagactgtg 



cccgggagac 
tggacaacct 
ccccgcagac 
ccccccaccc 
tccggaaggg 
acctggtaag 
ctgaatcaga 
atgctgagat 
agacccacgt 
agtctgaagt 
tgtacacagg 
ttgtgcgggt 
acaagagcat 
ctgcagtcaa 
taggagagca 
tctcactcta 
agcaatacct 
ccatctcaca 
ttgtctttga 
atgagttcag 
gctacaacat 



cgacgaagag 
gaggggcaag 
gctcaacatc 
cttcctgcat 
tgtgtatgtg 
catcccccat 
caagttcttc 
tgccaggcct 
tcccaacctc 
gctggcctct 
cagtctctgg 
ggagatcaat 
tgtggacagt 
atccatcaag 
gctggtgtgc 
cctaatgggt 
gcggccagtg 
gtcatccacg 
tcgggcccga 
gacggcagcg 
tccacagaca 



cccgaggagc 
tcggggcagg 
ctggtggata 
cgctactacc 
ccctacaccc 
ggccccaacg 
atcaacggct 
gacgactccc 
ttctccctgc 
gtcggaggga 
tatacaccca 
ggacaggatc 
ggcaccacca 
gcagcctcct 
tggcaagcag 
gaggttacca 
gaagatgtgg 
ggcactgtta 
aaacgaattg 
gtggaaggcc 
gatgagtcat 



ccggccggag 
gctactacgt 
caggcagcag 
agaggcagct 
agggcaagtg 
tcactgtgcg 
ccaactggga 
tggagccttt 
acctttgtgg 
gcatgatcat 
tccggcggga 
tgaaaatgga 
accttcgttt 
ccacggagaa 
gcaccacccc 
accagtcctt 
ccacgtccca 
tgggagctgt 
gctttgctgt 
cttttgtcac 
ga 



gggcagcttt 
ggagatgacc 
taactttgca 
gtccagcaca 
ggaaggggag 
tgccaacatt 
aggcatcctg 
ctttgactct 
tgctggcttc 
tggaggtatc 
gtggtattat 
ctgcaaggag 
gcccaagaaa 
gttccctgat 
ttggaacatt 
ccgcatcacc 
agacgactgt 
tatcatggag 
cagcgcttgc 
cttggacatg 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1362 



<210> 30 

<211> 453 

<212> PRT 

<213> Homo sapiens 

<400> 30 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp 
1 5 10 



Met Gly Ala Gly Val 
15 



Leu Pro 



Ala His 
20 



Gly Thr Gin His 



Gly lie Arg Leu 
2 5 



Pro Leu Arg Ser 
3 0 



Gly Leu 



Gly Gly 
35 



Ala Pro Leu Gly 
40 



Leu Arg Leu Pro 



Arg Glu Thr Asp 
45 



Glu Glu Pro Glu Glu Pro Gly Arg 
50 55 



Arg Gly Ser Phe 
60 



Val Glu Met Val 



Asp Asn Leu Arg Gly Lys Ser Gly 
65 70 



Gin Gly Tyr Tyr 
75 



Val Glu Met Thr 
80 



Val Gly Ser Pro 



Pro Gin Thr Leu 
85 



Asn lie Leu Val 
90 



Asp Thr Gly Ser 
95 



Ser Asn 



Phe Ala 
100 



Val Gly Ala Ala 



Pro His Pro Phe 
105 



Leu His Arg Tyr 
110 



Tyr Gin 



Arg Gin 
115 



Leu Ser Ser Thr 
120 



Tyr Arg Asp Leu 



Arg Lys Gly Val 
125 



Tyr Val Pro Tyr Thr Gin Gly Lys 
130 135 



Trp Glu Gly Glu 
140 



Leu Gly Thr Asp 



Leu Val Ser lie Pro 
145 

Ala Ala lie Thr Glu 
165 



His Gly Pro Asn Val 
150 

Ser Asp Lys Phe Phe 
170 



Thr Val Arg Ala Asn He 

155 160 

He Asn Gly Ser Asn Trp 
175 
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Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu lie Ala Arg Pro Asp Asp 
180 185 190 

Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro 
195 200 205 

Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 

Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met lie He Gly Gly He 
225 230 235 240 

Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg 
245 250 255 

Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin 
260 265 270 

Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 
275 280 285 

Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 
290 295 300 

Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 
305 310 315 320 

Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 

Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
340 345 350 

Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 380 

He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu 
385 390 395 400 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 

Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 



Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
435 440 445 

Gin Thr Asp Glu Ser 
450 



<210> 31 
<211> 1380 
<212> DNA 

<213> Homo sapiens 
<400> 31 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 
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ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccctgggg 120 
ctgcggctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagcttt 180 
gtggagatgg tggacaacct gaggggcaag tcggggcagg gctactacgt ggagatgacc 240 
gtgggcagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taactttgca 300 
gtgggtgctg ccccccaccc cttcctgcat cgctactacc agaggcagct gtccagcaca 360 
taccgggacc tccggaaggg tgtgtatgtg ccctacaccc agggcaagtg ggaaggggag 420 
ctgggcaccg acctggtaag catcccccat ggccccaacg tcactgtgcg tgccaacatt 480 
gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcatcctg 540 
gggctggcct atgctgagat tgccaggcct gacgactccc tggagccttt ctttgactct 600 
ctggtaaagc agacccacgt tcccaacctc ttctccctgc acctttgtgg tgctggcttc 660 
cccctcaacc agtctgaagt gctggcctct gtcggaggga gcatgatcat tggaggtatc 720 
gaccactcgc tgtacacagg cagtctctgg tatacaccca tccggcggga gtggtattat 780 
gaggtcatca ttgtgcgggt ggagatcaat ggacaggatc tgaaaatgga ctgcaaggag 840 
tacaactatg acaagagcat tgtggacagt ggcaccacca accttcgttt gcccaagaaa 900 
gtgtttgaag ctgcagtcaa atccatcaag gcagcctcct ccacggagaa gttccctgat 960 
ggtttctggc taggagagca gctggtgtgc tggcaagcag gcaccacccc ttggaacatt 102 0 
ttcccagtca tctcactcta cctaatgggt gaggttacca accagtcctt ccgcatcacc 1080 
atccttccgc agcaatacct gcggccagtg gaagatgtgg ccacgtccca agacgactgt 1140 
tacaagtttg ccatctcaca gtcatccacg ggcactgtta tgggagctgt tatcatggag 1200 
ggcttctacg ttgtctttga tcgggcccga aaacgaattg gctttgctgt cagcgcttgc 1260 
catgtgcacg atgagttcag gacggcagcg gtggaaggcc cttttgtcac cttggacatg 1320 
gaagactgtg gctacaacat tccacagaca gatgagtcac agcagcagca gcagcagtga 1380 

<210> 32 
<211> 459 
<212> PRT 

<213> Homo sapiens 
<400> 32 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Gly Ala Gly Val 
15 10 15 

Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro Leu Arg Ser 
20 25 30 

Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 ^ 140 

Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn lie 
145 150 155 160 



Ala Ala lie Thr Glu Ser Asp Lys Phe Phe lie Asn Gly Ser Asn Trp 
165 170 175 
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Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp 
180 185 190 

Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His Val Pro 
195 200 205 

Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 

Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He 
225 230 235 240 

Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg 
245 250 255 

Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin 
260 265 270 

Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 
275 280 285 

Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 
290 295 300 

Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 
305 310 315 320 

Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 

Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
340 345 350 

Thr Asn Gin Ser Phe Arg He Thr He Leu Pro Gin- Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 380 

He Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu 
385 390 395 400 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 

Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 

Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
435 440 445 

Gin Thr Asp Glu Ser His His His His His His 
450 455 



<210> 33 
<211> 25 
<212> PRT 

<213> Homo sapiens 



<400> 33 
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Ser Glu Gin Gin Arg Arg Pro Arg Asp Pro Glu Val Val Asn Asp Glu 
15 10 15 

Ser Ser Leu Val Arg His Arg Trp Lys 
20 25 



<210> 34 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 34 

Ser Glu Gin Leu Arg Gin Gin His Asp Asp Phe Ala Asp Asp lie Ser 
15 10 15 

Leu Leu Lys 



<210> 35 

<211> 29 

<212> DNA 

<213> Homo sapiens 



<400> 35 

gtggatccac ccagcacggc atccggctg 



29 



<210> 36 
<211> 36 
<212> DNA 

<213> Homo sapiens 



<400> 36 

gaaagctttc atgactcatc tgtctgtgga atgttg 



36 



<210> 37 
<211> 39 
<212> DNA 

<213> Homo sapiens 



<400> 37 

gatcgatgac tatctctgac tctccgcgtg aacaggacg 



39 



<210> 38 

<211> 39 

<212> DNA 

<213> Homo sapiens 



<400> 38 

gatccgtcct gttcacgcgg agagtcagag atagtcatc 



39 



<210> 39 

<211> 77 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2 



<400> 39 

cggcatccgg ctgcccctgc gtagcggtct gggtggtgct ccactgggtc tgcgtctgcc 60 
ccgggagacc gacgaag 77 
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<210> 40 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2 
<400> 40 

cttcgtcggt ctcccggggc agacgcagac ccagtggagc accacccaga ccgctacgca 60 
ggggcagccg gatgccg 77 

<210> 41 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 8 
Cleavage Site 

<400> 41 

gatcgatgac tatctctgac tctccgctgg actctggtat cgaaaccgac g 51 

<210> 42 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 8 
Cleavage Site 

<400> 42 

gatccgtcgg tttcgatacc agagtccagc ggagagtcag agatagtcat c 51 

<210> 43 
<211> 32 
<212> DNA 

<213> Homo sapiens 
<400> 43 

aaggatcctt tgtggagatg gtggacaacc tg 32 

<210> 44 
<211> 36 
<212> DNA 

<213> Homo sapiens 
<400> 44 

gaaagctttc atgactcatc tgtctgtgga atgttg 36 

<210> 45 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 6-His tag 



<400> 45 

gatcgcatca tcaccatcac catg 



24 
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<210> 46 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 6-His tag 
<400> 46 

gatccatggt gatggtgatg atgc 24 

<210> 47 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 48 

<211> 51 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 48 

cgaattaaat tccagcacac tggctacttc ttgttctgca tctcaaagaa c 51 

<210> 49 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 50 
<211> 1287 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2(b) 
delta TM 

<400> 50 

atggcccaag ccctgccctg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 

ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggcgc ccccctgggg 120 

ctgcggctgc cccgggagac cgacgaagag cccgaggagc ccggccggag gggcagcttt 180 

gtggagatgg tggacaacct gaggggcaag tcggggcagg gctactacgt ggagatgacc 240 

gtgggcagcc ccccgcagac gctcaacatc ctggtggata caggcagcag taactttgca 300 

gtgggtgctg ccccccaccc cttcctgcat cgctactacc agaggcagct gtccagcaca 360 

taccgggacc tccggaaggg tgtgtatgtg ccctacaccc agggcaagtg ggaaggggag 420 

ctgggcaccg acctggtaag catcccccat ggccccaacg tcactgtgcg tgccaacatt 480 

gctgccatca ctgaatcaga caagttcttc atcaacggct ccaactggga aggcatcctg 540 



<400> 47 

gactgaccac tcgaccaggt tc 



22 



<400> 49 

cgaattaaat tccagcacac tggcta 



26 
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gggctggcct 
gaagtgctgg 
acaggcagtc 
cgggtggaga 
agcattgtgg 
gtcaaatcca 
gagcagctgg 
ctctacctaa 
tacctgcggc 
tcacagtcat 
tttgatcggg 
ttcaggacgg 
aacattccac 



atgctgagat 
cctctgtcgg 
tctggtatac 
tcaatggaca 
acagtggcac 
tcaaggcagc 
tgtgctggca 
tgggtgaggt 
cagtggaaga 
ccacgggcac 
cccgaaaacg 
cagcggtgga 
agacagatga 



tgccaggctt 
agggagcatg 
acccatccgg 
ggatctgaaa 
caccaacctt 
ctcctccacg 
agcaggcacc 
taccaaccag 
tgtggccacg 
tgttatggga 
aattggcttt 
aggccctttt 
gtcatga 



-43- 

tgtggtgctg 
atcattggag 
cgggagtggt 
atggactgca 
cgtttgccca 
gagaagttcc 
accccttgga 
tccttccgca 
tcccaagacg 
gctgttatca 
gctgtcagcg 
gtcaccttgg 



gcttccccct 
gtatcgacca 
attatgaggt 
aggagtacaa 
agaaagtgtt 
ctgatggttt 
acattttccc 
tcaccatcct 
actgttacaa 
tggagggctt 
cttgccatgt 
acatggaaga 



<210> 51 

<211> 428 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2 (b) 
delta TM 



caaccagtc t 
ctcgctgtac 
catcat tgtg 
ctatgacaag 
tgaagctgca 
ctggctagga 
agtcatctca 
tccgcagcaa 
gt ttgccatc 
ctacgttgtc 
gcacgatgag 
ctgtggctac 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1287 



<400> 51 
Met Ala Gin 
1 

Leu Pro Ala 



Gly Leu Gly 
35 

Glu Glu Pro 
50 

Asp Asn Leu 
65 

Val Gly Ser 



Ser Asn Phe 



Tyr Gin Arg 
115 

Tyr Val Pro 
13 0 

Leu Val Ser 
145 

Ala Ala He 



Ala Leu 
5 

His Gly 
20 

Gly Ala 
Glu Glu 
Arg Gly 



Pro Pro 
85 

Ala Val 
100 



Pro Trp 
Thr Gin 
Pro Leu 



Pro Gly 
55 

Lys Ser 
70 



Leu Leu Leu Trp 
10 

His Gly He Arg 
25 

Gly Leu Arg Leu 
40 

Arg Arg Gly Ser 



Gly Gin Gly Tyr 
75 



Gin Thr Leu Asn He Leu 
90 

Gly Ala Ala Pro His Pro 
105 



Gin Leu Ser Ser 



Tyr Thr 
He Pro 



Thr Glu 
165 



Gin Gly 
135 

His Gly 
150 



Thr Tyr Arg Asp 
120 

Lys Trp Glu Gly 



Pro Asn Val Thr 
155 



Ser Asp Lys Phe Phe He 
170 



Met Gly Ala 

Leu Pro Leu 
30 

Pro Arg Glu 
45 

Phe Val Glu 
60 

Tyr Val Glu 

Val Asp Thr 

Phe Leu His 
110 

Leu Arg Lys 
125 

Glu Leu Gly 
14 0 

Val Arg Ala 
Asn Gly Ser 



Gly Val 
15 

Arg Ser 

Thr Asp 

Met Val 

Met Thr 
80 

Gly Ser 
95 

Arg Tyr 
Gly Val 
Thr Asp 



Asn He 
160 

Asn Trp 
175 



Glu Gly He Leu Gly Leu Ala Tyr 
180 

Ala Gly Phe Pro Leu Asn Gin Ser 

195 200 



Ala Glu He Ala Arg Leu Cys Gly 
185 190 

Glu Val Leu Ala Ser Val Gly Gly 
205 
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Ser Met lie lie Gly Gly lie Asp His Ser Leu Tyr Thr Gly Ser Leu 
210 215 220 

Trp Tyr Thr Pro lie Arg Arg Glu Trp Tyr Tyr Glu Val lie lie Val 
225 230 235 240 

Arg Val Glu lie Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr 
245 250 255 

Asn Tyr Asp Lys Ser lie Val Asp Ser Gly Thr Thr Asn Leu Arg Leu 
260 265 270 

Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser lie Lys Ala Ala Ser 
275 280 285 

Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val 
290 295 300 

Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser 
305 310 315 320 

Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He 
325 330 335 

Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin 
340 345 350 

Asp Asp Cys Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val 
355 360 365 

Met Gly Ala Val He Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala 
370 375 380 

Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu 
385 390 395 400 

Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu 
405 410 415 

Asp Cys Gly Tyr Asn He Pro Gin Thr Asp Glu Ser 
420 425 



<210> 52 

<211> 1305 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2 (b) 
delta TM 



<400> 52 

atggcccaag 

ggcacccagc 

ctgcggctgc 

gtggagatgg 

gtgggcagcc 

gtgggtgctg 

taccgggacc 

ctgggcaccg 

gctgccatca 

gggctggcct 



ccctgccctg 
acggcatccg 
cccgggagac 
tggacaacct 
ccccgcagac 
ccccccaccc 
tccggaaggg 
acctggtaag 
ctgaatcaga 
atgctgagat 



gctcctgctg 
gctgcccctg 
cgacgaagag 
gaggggcaag 
gctcaacatc 
cttcctgcat 
tgtgtatgtg 
catcccccat 
caagttcttc 
tgccaggctt 



tggatgggcg 
cgcagcggcc 
cccgaggagc 
tcggggcagg 
ctggtggata 
cgctactacc 
ccctacaccc 
ggccccaacg 
atcaacggct 
tgtggtgctg 



cgggagtgct 
tggggggcgc 
ccggccggag 
gctactacgt 
caggcagcag 
agaggcagct 
agggcaagtg 
tcactgtgcg 
ccaactggga 
gcttccccct 



gcctgcccac 
ccccctgggg 
gggcagcttt 
ggagatgacc 
taact ttgca 
gtccagcaca 
ggaaggggag 
tgccaacatt 
aggcatcctg 
caaccagtct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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gaagtgctgg 
acaggcagtc 
cgggtggaga 
agcattgtgg 
gtcaaatcca 
gagcagctgg 
ctctacctaa 
tacctgcggc 
tcacagtcat 
tttgatcggg 
ttcaggacgg 
aacattccac 



cctctgtcgg 
tctggtatac 
tcaatggaca 
acagtggcac 
tcaaggcagc 
tgtgctggca 
tgggtgaggt 
cagtggaaga 
ccacgggcac 
cccgaaaacg 
cagcggtgga 
agacagatga 



agggagcatg 
acccatccgg 
ggatctgaaa 
caccaacctt 
ctcctccacg 
agcaggcacc 
taccaaccag 
tgtggccacg 
tgttatggga 
aattggcttt 
aggccctttt 
gtcacagcag 



-45 - 

atcattggag 
cgggagtggt 
atggactgca 
cgtttgccca 
gagaagttcc 
accccttgga 
tccttccgca 
tcccaagacg 
gctgt tatca 
gctgtcagcg 
gtcaccttgg 
cagcagcagc 



gtatcgacca 
attatgaggt 
aggagtacaa 
agaaagtgtt 
ctgatggttt 
acattttccc 
tcaccatcct 
actgttacaa 
tggagggctt 
cttgccatgt 
acatggaaga 
agtga 



ctcgctgtac 
catcattgtg 
ctatgacaag 
tgaagctgca 
ctggctagga 
agtcatctca 
tccgcagcaa 
gtttgccatc 
ctacgttgtc 
gcacgatgag 
ctgtggctac 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1305 



<210> 53 
<211> 434 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hu-Asp2 (b) 
delta TM 



<400> 53 

Met Ala Gin Ala Leu 
1 5 

Leu Pro Ala His Gly 
20 



Pro Trp Leu Leu 
Thr Gin 



Gly Leu Gly Gly Ala Pro Leu 
35 



Glu Glu Pro Glu Glu 
50 

Asp Asn Leu Arg Gly 
65 

Val Gly Ser Pro Pro 
85 



Pro Gly 
55 

Lys Ser 
70 



His Gly 

25 

Gly Leu 
40 

Arg Arg 
Gly Gin 



Gin Thr Leu Asn 



Ser Asn Phe Ala Val Gly Ala 
100 



Ala Pro 
105 



Leu Trp 
10 

lie Arg 
Arg Leu 
Gly Ser 



Gly Tyr 
75 

lie Leu 
90 

His Pro 



Tyr Gin Arg Gin Leu 
115 

Tyr Val Pro Tyr Thr 
130 

Leu Val Ser lie Pro 
145 

Ala Ala He Thr Glu 
165 

Glu Gly He Leu Gly 
180 



Ser Ser 



Thr Tyr 
120 



Gin Gly 
135 

His Gly 
150 



Arg Asp 
Lys Trp Glu Gly 



Met Gly 

Leu Pro 

Pro Arg 
45 

Phe Val 
60 

Tyr Val 
Val Asp 
Phe Leu 



Leu Arg 
125 

Glu Leu 
140 



Pro Asn 



Val Thr 
155 



Ser Asp Lys Phe 
Leu Ala 



Ala Gly Phe Pro Leu Asn Gin 
195 



Tyr Ala 
185 

Ser Glu 
200 



Val Arg 
Asn Gly 
Glu He Ala Arg 



Phe He 
170 



Val Leu 



Ala Ser 
205 



Ala Gly Val 
15 

Leu Arg Ser 
30 

Glu Thr Asp 

Glu Met Val 

Glu Met Thr 
80 

Thr Gly Ser 
95 

His Arg Tyr 
110 

Lys Gly Val 
Gly Thr Asp 



Ala Asn He 
160 

Ser Asn Trp 
175 

Leu Cys Gly 
190 

Val Gly Gly 
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Ser Met lie lie Gly Gly lie Asp His Ser Leu Tyr Thr Gly Ser Leu 
210 ' 215 220 

Trp Tyr Thr Pro lie Arg Arg Glu Trp Tyr Tyr Glu Val lie lie Val 
225 230 235 240 

Arg Val Glu lie Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr 
245 250 255 

Asn Tyr Asp Lys Ser lie Val Asp Ser Gly Thr Thr Asn Leu Arg Leu 
260 265 270 

Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser lie Lys Ala Ala Ser 
275 280 285 

Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val 
290 295 300 

Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn lie Phe Pro Val lie Ser 
305 310 315 320 

Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg lie Thr lie 
325 330 335 

Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin 
340 345 350 

Asp Asp Cys Tyr Lys Phe Ala lie Ser Gin Ser Ser Thr Gly Thr Val 
355 360 365 

Met Gly Ala Val lie Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala 
370 375 380 

Arg Lys Arg lie Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu 
385 390 395 400 

Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu 
405 410 415 

Asp Cys Gly Tyr Asn lie Pro Gin Thr Asp Glu Ser His His His His 
420 425 430 

His His 



<210> 54 
<211> 2310 
<212> DNA 

<213> Homo sapiens 
<400> 54 

atgctgcccg gtttggcact gctcctgctg 

cccactgatg gtaatgctgg cctgctggct 

ctgaacatgc acatgaatgt ccagaatggg 

acctgcattg ataccaagga aggcatcctg 

cagatcacca atgtggtaga agccaaccaa 

ggccgcaagc agtgcaagac ccatccccac 

gagtttgtaa gtgatgccct tctcgttcct 

atggatgttt gcgaaactca tcttcactgg 

aagagtacca acttgcatga ctacggcatg 

ggggtagagt ttgtgtgttg cccactggct 



gccgcctgga cggctcgggc gctggaggta 60 

gaaccccaga ttgccatgtt ctgtggcaga 120 

aagtgggatt cagatccatc agggaccaaa 180 

cagtattgcc aagaagtcta ccctgaactg 240 

ccagtgacca tccagaactg gtgcaagcgg 300 

tttgtgattc cctaccgctg cttagttggt 360 

gacaagtgca aattcttaca ccaggagagg 420 

cacaccgtcg ccaaagagac atgcagtgag 480 

ttgctgccct gcggaattga caagttccga 540 

gaagaaagtg acaatgtgga ttctgctgat 600 
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gcggaggagg 
agtgaagaca 
gaagccgatg 
ccctacgaag 
gagtctgtgg 
cgagcaatga 
tacggcggat 
tgtggcagcg 
cctgttaaac 
acacctgggg 
aagcaccgag 
aagaacttgc 
tctttggaac 
gtggaagcca 
caggctgttc 
gaacagaagg 
aagaaagccg 
atgaatcagt 
gaagttgatg 
attagtgaac 
aaaaccaccg 
tggcattctt 
gatgcccgcc 
atcaagacgg 
tatgaagttc 
gcaatcattg 
gtgatgctga 
gctgtcaccc 
acctacaagt 



atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tctcccgctg 
gtggcggcaa 
ccatgtccca 
ttcctacaac 
atgagaatga 
agagaatgtc 
ctaaagctga 
aggaagcagc 
tgctcaatga 
ctcctcggcc 
acagacagca 
ctcagatccg 
ctctctccct 
agctgcttca 
caaggatcag 
tggagctcct 
ttggggctga 
ctgctgccga 
aggagatctc 
atcatcaaaa 
gactcatggt 
agaagaaaca 
cagaggagcg 
tctttgagca 



tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagaggtg 
gtactttgat 
ccggaacaac 
aagtttactc 
agcagccagt 
acatgcccat 
ccaggtcatg 
taagaaggca 
caacgagaga 
ccgccgccgc 
tcgtcacgtg 
caccctaaag 
gtcccaggtt 
gctctacaac 
gaaagagcaa 
ttacggaaac 
tcccgtgaat 
ctctgtgcca 
ccgaggactg 
tgaagtgaag 
attggtgttc 
gggcggtgtt 
gtacacatcc 
ccacctgtcc 
gatgcagaac 



ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
tgctctgaac 
gtgactgaag 
tttgacacag 
aagactaccc 
acccctgatg 
ttccagaaag 
agagaatggg 
gttatccagc 
cagcagctgg 
ctggccctgg 
ttcaatatgc 
catttcgagc 
atgacacacc 
gtgcctgcag 
aactattcag 
gatgctctca 
ggagagttca 
gccaacacag 
accactcgac 
atggatgcag 
tttgcagaag 
gtcatagcga 
attcatcatg 
aagatgcagc 



acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
aagccgagac 
ggaagtgtgc 
aagagtactg 
aggaacctct 
ccgttgacaa 
ccaaagagag 
aagaggcaga 
atttccagga 
tggagacaca 
agaactacat 
taaagaagta 
atgtgcgcat 
tccgtgtgat 
tggccgagga 
atgacgtctt 
tgccatcttt 
gcctggacga 
aaaacgaagt 
caggttctgg 
aattccgaca 
atgtgggttc 
cagtgatcgt 
gtgtggtgga 
agaacggcta 



tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
ggggccgtgc 
cccattcttt 
catggccgtg 
tggccgagat 
gtatctcgag 
gcttgaggcc 
acgtcaagca 
gaaagtggaa 
catggccaga 
caccgctctg 
tgtccgcgca 
ggtggatccc 
ttatgagcgc 
gattcaggat 
ggccaacatg 
gaccgaaacg 
tctccagccg 
tgagcctgtt 
gttgacaaat 
tgactcagga 
aaacaaaggt 
catcaccttg 
ggttgacgcc 
cgaaaatcca 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2310 



<210> 55 

<211> 770 

<212> PRT 

<213> Homo sapiens 



<400> 55 
Met Leu Pro 
1 



Gly Leu Ala Leu Leu Leu Leu Ala Ala 
5 10 



Trp Thr Ala Arg 
15 



Ala Leu Glu 



Val Pro Thr Asp 
2 0 



Gly Asn 
25 



Ala Gly Leu 



Leu Ala Glu Pro 
30 



Gin lie Ala 
35 



Met Phe Cys Gly 



Arg Leu 
40 



Asn Met His 



Met Asn Val Gin 
45 



Asn Gly Lys 
50 



Trp Asp Ser Asp 
55 



Pro Ser 



Gly Thr Lys 
60 



Thr Cys lie Asp 



Thr Lys Glu 
65 

Gin lie Thr 



Gly lie Leu Gin Tyr Cys 
70 

Asn Val Val Glu Ala Asn 
85 



Gin Glu Val 
75 

Gin Pro Val 
90 



Tyr Pro Glu Leu 
80 

Thr He Gin Asn 
95 



Trp Cys Lys 



Arg Gly Arg Lys 
100 



Gin Cys 
105 



Lys Thr His 



Pro His Phe Val 
110 



He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 



Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 • 135 140 
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Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met He 
290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 
325 330 335 

Cys Met Ala Val Cys Gly Ser Ala Met Ser Gin Ser Leu Leu Lys Thr 
340 345 350 

Thr Gin Glu Pro Leu Ala Arg Asp Pro Val Lys Leu Pro Thr Thr Ala 
355 360 365 

Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp 
370 375 380 

Glu Asn Glu His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala 
385 390 395 400 

Lys His Arg Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala 
405 410 415 

Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val He 
420 425 430 

Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn 
435 440 445 

Glu Arg Gin Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met 
450 455 460 

Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn Tyr He Thr Ala Leu 
465 *"* 470 475 480 
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Gln Ala Val Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys 
485 490 495 

Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe 
500 505 510 

Glu His Val Arg Met Val Asp Pro Lys Lys Ala Ala Gin He Arg Ser 
515 ~ 520 525 

Gin Val Met Thr His Leu Arg Val He Tyr Glu Arg Met Asn Gin Ser 
530 535 540 

Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala Glu Glu He Gin Asp 
545 550 555 560 

Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val 
565 570 575 

Leu Ala Asn Met He Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala 
580 585 590 

Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro 
595 600 605 

Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe 
610 615 620 

Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val 
625 630 635 640 

Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser 
645 650 655 

Gly Leu Thr Asn He Lys Thr Glu Glu He Ser Glu Val Lys Met Asp 
660 665 670 

Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu 
675 " 680 685 

Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He He Gly 
690 695 700 

Leu Met Val Gly Gly Val Val He Ala Thr Val He Val He Thr Leu 
705 710 715 720 

Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He His His Gly Val Val 
725 730 735 

Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met 
740 745 750 

Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met 
755 760 765 

Gin Asn 
770 



<210> 56 

<211> 2253 

<212> DNA 

<213> Homo sapiens 
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<400> 56 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 360 
gagtttgtaa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 
atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 540 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 72 0 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 
ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 
gagtctgtgg aagaggtggt tcgagaggtg tgctctgaac aagccgagac ggggccgtgc 900 
cgagcaatga tctcccgctg gtactttgat gtgactgaag ggaagtgtgc cccattcttt 960 
tacggcggat gtggcggcaa ccggaacaac tttgacacag aagagtactg catggccgtg 102 0 
tgtggcagcg ccattcctac aacagcagcc agtacccctg atgccgttga caagtatctc 1080 
gagacacctg gggatgagaa tgaacatgcc catttccaga aagccaaaga gaggcttgag 1140 
gccaagcacc gagagagaat gtcccaggtc atgagagaat gggaagaggc agaacgtcaa 12 0 0 
gcaaagaact tgcctaaagc tgataagaag gcagttatcc agcatttcca ggagaaagtg 1260 
gaatctttgg aacaggaagc agccaacgag agacagcagc tggtggagac acacatggcc 1320 
agagtggaag ccatgctcaa tgaccgccgc cgcctggccc tggagaacta catcaccgct 1380 
ctgcaggctg ttcctcctcg gcctcgtcac gtgttcaata tgctaaagaa gtatgtccgc 1440 
gcagaacaga aggacagaca gcacacccta aagcatttcg agcatgtgcg catggtggat 1500 
cccaagaaag ccgctcagat ccggtcccag gttatgacac acctccgtgt gatttatgag 1560 
cgcatgaatc agtctctctc cctgctctac aacgtgcctg cagtggccga ggagattcag 1620 
gatgaagttg atgagctgct tcagaaagag caaaactatt cagatgacgt cttggccaac 1680 
atgattagtg aaccaaggat cagttacgga aacgatgctc tcatgccatc tttgaccgaa 1740 
acgaaaacca ccgtggagct ccttcccgtg aatggagagt tcagcctgga cgatctccag 1800 
ccgtggcatt cttttggggc tgactctgtg ccagccaaca cagaaaacga agttgagcct 1860 
gttgatgccc gccctgctgc cgaccgagga ctgaccactc gaccaggttc tgggttgaca 1920 
aatatcaaga cggaggagat ctctgaagtg aagatggatg cagaattccg acatgactca 1980 
ggatatgaag ttcatcatca aaaattggtg ttctttgcag aagatgtggg ttcaaacaaa 2040 
ggtgcaatca ttggactcat ggtgggcggt gttgtcatag cgacagtgat cgtcatcacc 2100 
ttggtgatgc tgaagaagaa acagtacaca tccattcatc atggtgtggt ggaggttgac 2160 
gccgctgtca ccccagagga gcgccacctg tccaagatgc agcagaacgg ctacgaaaat 2220 
ccaacctaca agttctttga gcagatgcag aac 2253 

<210> 57 

<211> 751 

<212> PRT 

<213> Homo sapiens 

<400> 57 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 ~ 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 
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Gin lie Thr Asn 



Trp Cys Lys Arg 
100 

lie Pro Tyr Arg 
115 

Val Pro Asp Lys 
13 0 

Glu Thr His Leu 
145 

Lys Ser Thr Asn 



Asp Lys Phe Arg 
180 

Ser Asp Asn Val 
195 

Trp Trp Gly Gly 
210 

Val Val Glu Val 
225 

Glu Ala Asp Asp 



Glu Ala Glu Glu 
260 

Ala Thr Thr Thr 
275 

Glu Val Cys Ser 
290 

Ser Arg Trp Tyr 
305 

Tyr Gly Gly Cys 



Cys Met Ala Val 
340 

Pro Asp Ala Val 
355 

His Ala His Phe 
370 

Glu Arg Met Ser 
385 

Ala Lys Asn Leu 



Val Val 
85 

Gly Arg 

Cys Leu 

Cys Lys 

His Trp 
150 

Leu His 
165 

Gly Val 

Asp Ser 

Ala Asp 

Ala Glu 
230 

Asp Glu 
245 

Pro Tyr 
Thr Thr 



Glu Gin 



Phe Asp 
310 

Gly Gly 
325 

Cys Gly 

Asp Lys 

Gin Lys 

Gin Val 
390 

Pro Lys 
4 05 



Glu Ala 

Lys Gin 

Val Gly 
120 

Phe Leu 
135 

His Thr 

Asp Tyr 

Glu Phe 

Ala Asp 
200 

Thr Asp 
215 

Glu Glu 



Asp Asp 



Glu Glu 



Thr Thr 
280 

Ala Glu 
295 

Val Thr 



Asn Arg 



Ser Ala 



Tyr Leu 
360 

Ala Lys 
375 

Met Arg 



Ala Asp 



-51 • 

Asn Gin 
90 

Cys Lys 
105 

Glu Phe 

His Gin 

Val Ala 

Gly Met 
170 

Val Cys 
185 

Ala Glu 

Tyr Ala 

Glu Val 

Glu Asp 
250 

Ala Thr 
265 

Glu Ser 

Thr Gly 

Glu Gly 

Asn Asn 
330 

lie Pro 
345 

Glu Thr 

Glu Arg 

Glu Trp 

Lys Lys 
410 



Pro Val 

Thr His 

Val Ser 

Glu Arg 
140 

Lys Glu 
155 

Leu Leu 

Cys Pro 

Glu Asp 

Asp Gly 
220 

Ala Glu 
235 

Gly Asp 

Glu Arg 

Val Glu 

Pro Cys 
300 

Lys Cys 
315 

Phe Asp 

Thr Thr 

Pro Gly 

Leu Glu 
380 

Glu Glu 
395 

Ala Val 



Thr He 



Pro His 
110 

Asp Ala 
125 

Met Asp 

Thr Cys 

Pro Cys 

Leu Ala 
190 

Asp Ser 
205 

Ser Glu 

Val Glu 

Glu Val 

Thr Thr 
270 

Glu Val 
285 

Arg Ala 
Ala Pro 
Thr Glu 

Ala Ala 
350 

Asp Glu 
365 

Ala Lys 
Ala Glu 
He Gin 



Gin Asn 
95 

Phe Val 

Leu Leu 

Val Cys 

Ser Glu 
160 

Gly He 
175 

Glu Glu 

Asp Val 

Asp Lys 

Glu Glu 
240 

Glu Glu 
255 

Ser He 

Val Arg 

Met He 

Phe Phe 
320 

Glu Tyr 
335 

Ser Thr 

Asn Glu 

His Arg 

Arg Gin 
400 

His Phe 
415 
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Gin Glu Lys Val 
420 

Gin Leu Val Glu 
435 

Arg Arg Arg Leu 
450 

Pro Pro Arg Pro 
465 

Ala Glu Gin Lys 



Arg Met Val Asp 
500 

Thr His Leu Arg 
515 

Leu Tyr Asn Val 
530 

Glu Leu Leu Gin 
545 

Met lie Ser Glu 



Ser Leu Thr Glu 
580 

Glu Phe Ser Leu 
595 

Ser Val Pro Ala 
610 



Pro Ala Ala Asp 
625 

Asn lie Lys Thr 



Arg His Asp Ser 
660 

Ala Glu Asp Val 
675 

Gly Gly Val Val 
690 

Lys Lys Lys Gin 
705 

Ala Ala Val Thr 



Gly Tyr Glu Asn 
740 



Glu Ser 

Thr His 

Ala Leu 

Arg His 
470 

Asp Arg 
485 

Pro Lys 

Val lie 

Pro Ala 

Lys Glu 
550 

Pro Arg 
5 65 

Thr Lys 
Asp Asp 
Asn Thr 

Arg Gly 
630 

Glu Glu 
645 

Gly Tyr 

Gly Ser 

He Ala 

Tyr Thr 
710 

Pro Glu 
725 

Pro Thr 



Leu Glu 

Met Ala 
440 

Glu Asn 
455 

Val Phe 

Gin His 

Lys Ala 

Tyr Glu 
520 

Val Ala 
535 

Gin Asn 

He Ser 

Thr Thr 

Leu Gin 
600 

Glu Asn 
615 

Leu Thr 

He Ser 

Glu Val 

Asn Lys 
680 

Thr Val 
695 

Ser He 
Glu Arg 
Tyr Lys 



-52- 

Gln Glu 
425 

Arg Val 

Tyr He 

Asn Met 

Thr Leu 
490 

Ala Gin 
505 

Arg Met 

Glu Glu 

Tyr Ser 

Tyr Gly 
570 

Val Glu 
585 

Pro Trp 
Glu Val 

Thr Arg 

Glu Val 
650 

His His 
665 

Gly Ala 

He Val 

His His 

His Leu 
730 

Phe Phe 
745 



Ala Ala 

Glu Ala 

Thr Ala 
460 

Leu Lys 
475 

Lys His 

lie Arg 

Asn Gin 

He Gin 
540 

Asp Asp 
555 

Asn Asp 

Leu Leu 

His Ser 

Glu Pro 
620 

Pro Gly 
635 

Lys Met 

Gin Lys 

He He 

He Thr 
700 

Gly Val 
715 

Ser Lys 
Glu Gin 



Asn Glu 
430 

Met Leu 
445 

Leu Gin 

Lys Tyr 

Phe Glu 

Ser Gin 
510 

Ser Leu 
525 

Asp Glu 

Val Leu 

Ala Leu 

Pro Val 
590 

Phe Gly 
605 

Val Asp 

Ser Gly 

Asp Ala 

Leu Val 
670 

Gly Leu 
685 

Leu Val 
Val Glu 



Met Gin 



Met Gin 
750 



Arg Gin 
Asn Asp 
Ala Val 
Val Arg 

480 

His Val 
495 

Val Met 

Ser Leu 

Val Asp 

Ala Asn 
560 

Met Pro 
575 

Asn Gly 
Ala Asp 
Ala Arg 

Leu Thr 
640 

Glu Phe 
655 

Phe Phe 

Met Val 

Met Leu 

Val Asp 
720 

Gin Asn 
735 

Asn 
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<210> 58 

<211> 2316 

<212> DNA 

<213> Homo sapiens 



<400> 58 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttgtaa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

cgagcaatga 

tacggcggat 

tgtggcagcg 

cctgttaaac 

acacctgggg 

aagcaccgag 

aagaacttgc 

tctttggaac 

gtggaagcca 

caggctgttc 

gaacagaagg 

aagaaagccg 

atgaatcagt 

gaagttgatg 

attagtgaac 

aaaaccaccg 

tggcattctt 

gatgcccgcc 

atcaagacgg 

tatgaagttc 

gcaatcattg 

gtgatgctga 

gctgtcaccc 

acctacaagt 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tctcccgctg 
gtggcggcaa 
ccatgtccca 
ttcctacaac 
atgagaatga 
agagaatgtc 
ctaaagctga 
aggaagcagc 
tgctcaatga 
ctcctcggcc 
acagacagca 
ctcagatccg 
ctctctccct 
agctgcttca 
caaggatcag 
tggagctcct 
ttggggctga 
ctgctgccga 
aggagatctc 
atcatcaaaa 
gactcatggt 
agaagaaaca 
cagaggagcg 
tctttgagca 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagaggtg 
gtactttgat 
ccggaacaac 
aagtttactc 
agcagccagt 
acatgcccat 
ccaggtcatg 
taagaaggca 
caacgagaga 
ccgccgccgc 
tcgtcacgtg 
caccctaaag 
gtcccaggtt 
gctctacaac 
gaaagagcaa 
ttacggaaac 
tcccgtgaat 
ctctgtgcca 
ccgaggactg 
tgaagtgaag 
attggtgttc 
gggcggtgtt 
gtacacatcc 
ccacctgtcc 
gatgcagaac 



gccgcctgga 
gaaccccaga 
aagtgggat t 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
tgctctgaac 
gtgactgaag 
tttgacacag 
aagactaccc 
acccctgatg 
ttccagaaag 
agagaatggg 
gttatccagc 
cagcagctgg 
ctggccctgg 
ttcaatatgc 
catttcgagc 
atgacacacc 
gtgcctgcag 
aactattcag 
gatgctctca 
ggagagttca 
gccaacacag 
accactcgac 
atggatgcag 
tttgcagaag 
gtcatagcga 
attcatcatg 
aagatgcagc 
aagaag 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
aagccgagac 
ggaagtgtgc 
aagagtactg 
aggaacctct 
ccgttgacaa 
ccaaagagag 
aagaggcaga 
atttccagga 
tggagacaca 
agaactacat 
taaagaagta 
atgtgcgcat 
tccgtgtgat 
tggccgagga 
atgacgtctt 
tgccatcttt 
gcctggacga 
aaaacgaagt 
caggttctgg 
aattccgaca 
atgtgggttc 
cagtgatcgt 
gtgtggtgga 
agaacggcta 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
ggggccgtgc 
cccattcttt 
catggccgtg 
tggccgagat 
gtatctcgag 
gcttgaggcc 
acgtcaagca 
gaaagtggaa 
catggccaga 
caccgctctg 
tgtccgcgca 
ggtggatccc 
ttatgagcgc 
gattcaggat 
ggccaacatg 
gaccgaaacg 
tctccagccg 
tgagcctgtt 
gttgacaaat 
tgactcagga 
aaacaaaggt 
catcaccttg 
ggttgacgcc 
cgaaaatcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

3260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2316 



<210> 59 
<211> 772 
<212> PRT 

<213> Homo sapiens 
<400> 59 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala 
15 10 



Ala Trp Thr Ala Arg 
15 



Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 



Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 
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Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr lie Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

lie Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 " 215 ~ 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met lie 
290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 
325 330 335 

Cys Met Ala Val Cys Gly Ser Ala Met Ser Gin Ser Leu Leu Lys Thr 
340 345 350 

Thr Gin Glu Pro Leu Ala Arg Asp Pro Val Lys Leu Pro Thr Thr Ala 
355 360 365 

Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp 
370 375 380 



WO 01/23533 



PCT/US00/26080 



-55- 

Glu Asn Glu His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala 
385 390 395 400 

Lys His Arg Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala 
405 410 415 

Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val lie 
420 425 430 

Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn 
435 440 445 

Glu Arg Gin Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met 
450 455 460 



Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn Tyr lie Thr Ala Leu 
465 470 475 480 

Gin Ala Val Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys 
485 ~ 490 495 

Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe 
500 505 510 

Glu His Val Arg Met Val Asp Pro Lys Lys Ala Ala Gin lie Arg Ser 
515 520 525 

Gin Val Met Thr His Leu Arg Val He Tyr Glu Arg Met Asn Gin Ser 
530 535 540 

Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala Glu Glu He Gin Asp 
545 550 555 560 

Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val 
565 570 575 

Leu Ala Asn Met He Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala 
580 585 590 

Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro 
595 600 605 

Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe 
610 615 620 

Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val 
625 630 635 640 

Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser 
645 650 655 

Gly Leu Thr Asn He Lys Thr Glu Glu He Ser Glu Val Lys Met Asp 
660 665 670 

Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu 
675 680 685 

Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He He Gly 
690 695 700 

Leu Met Val Gly Gly Val Val He Ala Thr Val He Val He Thr Leu 
705 710 715 720 
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Val Met Leu Lys 

Glu Val Asp Ala 
740 



Lys Lys Gin Tyr 
725 

Ala Val Thr Pro 



-56- 

Thr Ser lie His 
730 

Glu Glu Arg His 
745 



His Gly Val Val 
735 

Leu Ser Lys Met 
750 



Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met 
755 760 765 

Gin Asn Lys Lys 
770 



<210> 60 

<211> 2259 

<212> DNA 

<213> Homo sapiens 



<400> 60 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttgtaa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

cgagcaatga 

tacggcggat 

tgtggcagcg 

gagacacctg 

gccaagcacc 

gcaaagaact 

gaatctttgg 

agagtggaag 

ctgcaggctg 

gcagaacaga 

cccaagaaag 

cgcatgaatc 

gatgaagttg 

atgattagtg 

acgaaaacca 

ccgtggcat t 

gttgatgccc 

aatatcaaga 

ggatatgaag 

ggtgcaatca 

ttggtgatgc 

gccgctgtca 

ccaacctaca 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tctcccgctg 
gtggcggcaa 
ccattcctac 
gggatgagaa 
gagagagaat 
tgcctaaagc 
aacaggaagc 
ccatgctcaa 
ttcctcctcg 
aggacagaca 
ccgctcagat 
agtctctctc 
atgagctgct 
aaccaaggat 
ccgtggagct 
ct tttggggc 
gccctgctgc 
cggaggagat 
t tcatcatca 
ttggactcat 
tgaagaagaa 
ccccagagga 
agttctttga 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagaggtg 
gtactttgat 
ccggaacaac 
aacagcagcc 
tgaacatgcc 
gtcccaggtc 
tgataagaag 
agccaacgag 
tgaccgccgc 
gcctcgtcac 
gcacacccta 
ccggtcccag 
cctgctctac 
tcagaaagag 
cagttacgga 
ccttcccgtg 
tgactctgtg 
cgaccgagga 
ctctgaagtg 
aaaattggtg 
ggtgggcggt 
acagtacaca 
gcgccacctg 
gcagatgcag 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
tgctctgaac 
gtgactgaag 
tttgacacag 
agtacccctg 
catttccaga 
atgagagaat 
gcagttatcc 
agacagcagc 
cgcctggccc 
gtgttcaata 
aagcatttcg 
gttatgacac 
aacgtgcctg 
caaaactatt 
aacgatgctc 
aatggagagt 
ccagccaaca 
ctgaccactc 
aagatggatg 
ttctttgcag 
gttgtcatag 
tccattcatc 
tccaagatgc 
aacaagaag 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
aagccgagac 
ggaagtgtgc 
aagagtactg 
atgccgttga 
aagccaaaga 
gggaagaggc 
agcatttcca 
tggtggagac 
tggagaacta 
tgctaaagaa 
agcatgtgcg 
acctccgtgt 
cagtggccga 
cagatgacgt 
tcatgccatc 
tcagcctgga 
cagaaaacga 
gaccaggttc 
cagaattccg 
aagatgtggg 
cgacagtgat 
atggtgtggt 
agcagaacgg 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
ggggccgtgc 
cccattcttt 
catggccgtg 
caagtatctc 
gaggcttgag 
agaacgtcaa 
ggagaaagtg 
acacatggcc 
catcaccgct 
gtatgtccgc 
catggtggat 
gatttatgag 
ggagattcag 
cttggccaac 
tttgaccgaa 
cgatctccag 
agttgagcct 
tgggttgaca 
acatgactca 
ttcaaacaaa 
cgtcatcacc 
ggaggttgac 
ctacgaaaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2259 



<210> 61 

<211> 753 

<212> PRT 

<213> Homo sapiens 
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<400> 61 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr lie Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

lie Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 ' 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
. 210 215 ~ 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 * 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 



Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 

275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met lie 

290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 

305 310 315 320 
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Tyr Gly Gly Cys 



Cys Met Ala Val 
340 

Pro Asp Ala Val 
355 

His Ala His Phe 
370 

Glu Arg Met Ser 
385 

Ala Lys Asn Leu 



Gin Glu Lys Val 
420 

Gin Leu Val Glu 
435 

Arg Arg Arg Leu 
450 

Pro Pro Arg Pro 
4 65 

Ala Glu Gin Lys 



Arg Met Val Asp 
500 

Thr His Leu Arg 
515 

Leu Tyr Asn Val 
530 

Glu Leu Leu Gin 
545 



Met lie Ser Glu 



Ser Leu Thr Glu 
580 

Glu Phe Ser Leu 
595 

Ser Val Pro Ala 
610 

Pro Ala Ala Asp 
62 5 



Gly Gly Asn Arg 
325 

Cys Gly Ser Ala 



Asp Lys Tyr Leu 
360 

Gin Lys Ala Lys 
375 

Gin Val Met Arg 
390 

Pro Lys Ala Asp 
405 

Glu Ser Leu Glu 



Thr His Met Ala 
440 

Ala Leu Glu Asn 
455 

Arg His Val Phe 
470 

Asp Arg Gin His 
485 

Pro Lys Lys Ala 



Val lie Tyr Glu 
520 

Pro Ala Val Ala 
535 

Lys Glu Gin Asn 
550 



Pro Arg lie Ser 
565 

Thr Lys Thr Thr 



Asp Asp Leu Gin 
600 

Asn Thr Glu Asn 
615 

Arg Gly Leu Thr 
630 



-58- 

Asn Asn Phe Asp 
330 

lie Pro Thr Thr 
345 

Glu Thr Pro Gly 



Glu Arg Leu Glu 
380 

Glu Trp Glu Glu 
395 

Lys Lys Ala Val 
410 

Gin Glu Ala Ala 
425 

Arg Val Glu Ala 



Tyr lie Thr Ala 
460 

Asn Met Leu Lys 
475 

Thr Leu Lys His 
490 

Ala Gin lie Arg 
505 

Arg Met Asn Gin 



Glu Glu lie Gin 
540 

Tyr Ser Asp Asp 
555 



Tyr Gly Asn Asp 
570 

Val Glu Leu Leu 
585 

Pro Trp His Ser 



Glu Val Glu Pro 
620 

Thr Arg Pro Gly 
635 



Thr Glu Glu Tyr 
335 

Ala Ala Ser Thr 
350 

Asp Glu Asn Glu 
365 

Ala Lys His Arg 



Ala Glu Arg Gin 
400 

lie Gin His Phe 
415 

Asn Glu Arg Gin 
430 

Met Leu Asn Asp 
445 

Leu Gin Ala Val 



Lys Tyr Val Arg 
4 8 0 

Phe Glu His Val 
495 

Ser Gin Val Met 
510 

Ser Leu Ser Leu 
525 

Asp Glu Val Asp 



Val Leu Ala Asn 
560 



Ala Leu Met Pro 
575 

Pro Val Asn Gly 
590 

Phe Gly Ala Asp 
605 

Val Asp Ala Arg 



Ser Gly Leu Thr 
640 



Asn lie Lys Thr Glu Glu lie Ser Glu Val Lys Met Asp Ala Glu Phe 
645 650 * 655 
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Arg His Asp Ser Gly 
660 

Ala Glu Asp Val Gly 
675 

Gly Gly Val Val He 
690 

Lys Lys Lys Gin Tyr 
705 

Ala Ala Val Thr Pro 
725 

Gly Tyr Glu Asn Pro 
740 

Lys 



-59- 

Tyr Glu Val His His Gin 
665 

Ser Asn Lys Gly Ala He 
680 

Ala Thr Val He Val He 
695 

Thr Ser He His His Gly 
710 715 

Glu Glu Arg His Leu Ser 
730 

Thr Tyr Lys Phe Phe Glu 
745 



Lys Leu Val Phe Phe 
670 

lie Gly Leu Met Val 
685 

Thr Leu Val Met Leu 
700 

Val Val Glu Val Asp 
720 

Lys Met Gin Gin Asn 
735 

Gin Met Gin Asn Lys 
750 



<210> 62 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
<400> 62 

Leu Glu Val Leu Phe Gin Gly Pro 
1 5 



<210> 63 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 

<400> 63 

Ser Glu Val Asn Leu Asp Ala Glu Phe Arg 
1 5 10 



<210> 64 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
<400> 64 

Ser Glu Val Lys Met Asp Ala Glu Phe Arg 
15 10 



<210> 65 
<211> 15 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
<400> 65 

Arg Arg Gly Gly Val Val lie Ala Thr Val lie Val Gly Glu Arg 
15 10 15 



<210> 66 
<211> 518 
<212> PRT 

<213> Homo sapiens 
<400> 66 

Met Gly Ala Leu Ala Arg Ala Leu Leu Leu Pro Leu Leu Ala Gin Trp 
1 5 10 15 

Leu Leu Arg Ala Ala Pro Glu Leu Ala Pro Ala Pro Phe Thr Leu Pro 
20 25 30 

Leu Arg Val Ala Ala Ala Thr Asn Arg Val Val Ala Pro Thr Pro Gly 
35 40 45 

Pro Gly Thr Pro Ala Glu Arg His Ala Asp Gly Leu Ala Leu Ala Leu 
50 55 60 

Glu Pro Ala Leu Ala Ser Pro Ala Gly Ala Ala Asn Phe Leu Ala Met 
65 70 75 60 

Val Asp Asn Leu Gin Gly Asp Ser Gly Arg Gly Tyr Tyr Leu Glu Met 
85 90 95 

Leu He Gly Thr Pro Pro Gin Lys Leu Gin He Leu Val Asp Thr Gly 
100 105 HO 

Ser Ser Asn Phe Ala Val Ala Gly Thr Pro His Ser Tyr He Asp Thr 
115 120 125 

Tyr Phe Asp Thr Glu Arg Ser Ser Thr Tyr Arg Ser Lys Gly Phe Asp 
130 135 140 

Val Thr Val Lys Tyr Thr Gin Gly Ser Trp Thr Gly Phe Val Gly Glu 
145 150 155 160 

Asp Leu Val Thr He Pro Lys Gly Phe Asn Thr Ser Phe Leu Val Asn 
165 170 175 

He Ala Thr He Phe Glu Ser Glu Asn Phe Phe Leu Pro Gly He Lys 
180 185 190 

Trp Asn Gly He Leu Gly Leu Ala Tyr Ala Thr Leu Ala Lys Pro Ser 
195 200 205 

Ser Ser Leu Glu Thr Phe Phe Asp Ser Leu Val Thr Gin Ala Asn He 
210 215 220 

Pro Asn Val Phe Ser Met Gin Met Cys Gly Ala Gly Leu Pro Val Ala 
225 230 235 240 
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Gly Ser Gly Thr Asn Gly Gly Ser Leu Val Leu Gly Gly lie Glu Pro 
245 250 255 

Ser Leu Tyr Lys Gly Asp lie Trp Tyr Thr Pro lie Lys Glu Glu Trp 
260 265 270 

Tyr Tyr Gin lie Glu lie Leu Lys Leu Glu lie Gly Gly Gin Ser Leu 
275 280 285 

Asn Leu Asp Cys Arg Glu Tyr Asn Ala Asp Lys Ala He Val Asp Ser 
290 295 300 

Gly Thr Thr Leu Leu Arg Leu Pro Gin Lys Val Phe Asp Ala Val Val 
305 310 315 320 

Glu Ala Val Ala Arg Ala Ser Leu He Pro Glu Phe Ser Asp Gly Phe 
325 330 335 

Trp Thr Gly Ser Gin Leu Ala Cys Trp Thr Asn Ser Glu Thr Pro Trp 
340 345 350 

Ser Tyr Phe Pro Lys He Ser He Tyr Leu Arg Asp Glu Asn Ser Ser 
355 360 365 

Arg Ser Phe Arg He Thr He Leu Pro Gin Leu Tyr He Gin Pro Met 
370 375 380 

Met Gly Ala Gly Leu Asn Tyr Glu Cys Tyr Arg Phe Gly He Ser Pro 
385 390 395 400 

Ser Thr Asn Ala Leu Val He Gly Ala Thr Val Met Glu Gly Phe Tyr 
405 410 415 

Val He Phe Asp Arg Ala Gin Lys Arg Val Gly Phe Ala Ala Ser Pro 
420 425 430 

Cys Ala Glu He Ala Gly Ala Ala Val Ser Glu He Ser Gly Pro Phe 
435 440 445 

Ser Thr Glu Asp Val Ala Ser Asn Cys Val Pro Ala Gin Ser Leu Ser 
450 455 460 

Glu Pro He Leu Trp He Val Ser Tyr Ala Leu Met Ser Val Cys Gly 
465 470 475 480 

Ala lie Leu Leu Val Leu He Val Leu Leu Leu Leu Pro Phe Arg Cys 
485 490 495 

Gin Arg Arg Pro Arg Asp Pro Glu Val Val Asn Asp Glu Ser Ser Leu 
500 505 510 

Val Arg His Arg Trp Lys 
515 



<210> 67 

<211> 475 

<212> PRT 

<213> Homo sapiens 



<400> 67 

Met Gly Ala Leu Ala Arg Ala Leu Leu Leu Pro Leu Leu Ala Gin Trp 
1 5 ~ 10 15 
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Leu Leu Arg Ala Ala Pro Glu Leu Ala Pro Ala Pro Phe Thr Leu Pro 
20 25 30 

Leu Arg Val Ala Ala Ala Thr Asn Arg Val Val Ala Pro Thr Pro Gly 
35 40 45 

Pro Gly Thr Pro Ala Glu Arg His Ala Asp Gly Leu Ala Leu Ala Leu 
50 55 60 

Glu Pro Ala Leu Ala Ser Pro Ala Gly Ala Ala Asn Phe Leu Ala Met 
65 70 75 80 

Val Asp Asn Leu Gin Gly Asp Ser Gly Arg Gly Tyr Tyr Leu Glu Met 
85 90 95 

Leu lie Gly Thr Pro Pro Gin Lys Leu Gin lie Leu Val Asp Thr Gly 
100 105 110 

Ser Ser Asn Phe Ala Val Ala Gly Thr Pro His Ser Tyr lie Asp Thr 
115 120 125 

Tyr Phe Asp Thr Glu Arg Ser Ser Thr Tyr Arg Ser Lys Gly Phe Asp 
130 135 140 

Val Thr Val Lys Tyr Thr Gin Gly Ser Trp Thr Gly Phe Val Gly Glu 
145 150 155 160 

Asp Leu Val Thr lie Pro Lys Gly Phe Asn Thr Ser Phe Leu Val Asn 
165 170 175 

lie Ala Thr lie Phe Glu Ser Glu Asn Phe Phe Leu Pro Gly lie Lys 
180 185 190 

Trp Asn Gly lie Leu Gly Leu Ala Tyr Ala Thr Leu Ala Lys Pro Ser 
195 ~ 200 ~ 205 

Ser Ser Leu Glu Thr Phe Phe Asp Ser Leu Val Thr Gin Ala Asn lie 
210 215 220 

Pro Asn Val Phe Ser Met Gin Met Cys Gly Ala Gly Leu Pro Val Ala 
225 230 235 240 

Gly Ser Gly Thr Asn Gly Gly Ser Leu Val Leu Gly Gly lie Glu Pro 
245 250 255 

Ser Leu Tyr Lys Gly Asp lie Trp Tyr Thr Pro lie Lys Glu Glu Trp 
260 265 270 

Tyr Tyr Gin lie Glu lie Leu Lys Leu Glu He Gly Gly Gin Ser Leu 
275 280 285 

Asn Leu Asp Cys Arg Glu Tyr Asn Ala Asp Lys Ala He Val Asp Ser 
290 295 300 

Gly Thr Thr Leu Leu Arg Leu Pro Gin Lys Val Phe Asp Ala Val Val 
305 310 315 320 

Glu Ala Val Ala Arg Ala Ser Leu He Pro Glu Phe Ser Asp Gly Phe 
325 330 335 

Trp Thr Gly Ser Gin Leu Ala Cys Trp Thr Asn Ser Glu Thr Pro Trp 
340 345 350 
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Ser Tyr Phe Pro 
355 

Arg Ser Phe Arg 
370 

Met Gly Ala Gly 
385 

Ser Thr Asn Ala 



Val lie Phe Asp 
420 

Cys Ala Glu lie 
435 

Ser Thr Glu Asp 
450 

Glu Pro lie Leu 
465 



Lys lie Ser lie 
360 

lie Thr lie Leu 
375 

Leu Asn Tyr Glu 
390 

Leu Val lie Gly 
405 

Arg Ala Gin Lys 



Ala Gly Ala Ala 
440 

Val Ala Ser Asn 
455 

Trp His His His 
470 



-63- 



Tyr Leu Arg Asp 



Pro Gin Leu Tyr 
380 

Cys Tyr Arg Phe 
395 

Ala Thr Val Met 
410 

Arg Val Gly Phe 
425 

Val Ser Glu lie 



Cys Val Pro Ala 
460 

His His His 
475 



Glu Asn Ser Ser 
365 

He Gin Pro Met 



Gly He Ser Pro 
400 

Glu Gly Phe Tyr 
415 

Ala Ala Ser Pro 
430 

Ser Gly Pro Phe 
445 

Gin Ser Leu Ser 



<210> 68 
<211> 413 
<212> PRT 

<213> Homo sapiens 
<400> 68 

Ala Leu Glu Pro Ala Leu Ala Ser Pro Ala Gly Ala Ala Asn Phe Leu 
1^5 10 15 

Ala Met Val Asp Asn Leu Gin Gly Asp Ser Gly Arg Gly Tyr Tyr Leu 
20 25 - 3q 

Glu Met Leu He Gly Thr Pro Pro Gin Lys Leu Gin He Leu Val Asp 
35 40 45 

Thr, Gly Ser Ser Asn Phe Ala Val Ala Gly Thr Pro His Ser Tyr He 
50 55 60 

Asp Thr Tyr Phe Asp Thr Glu Arg Ser Ser Thr Tyr Arg Ser Lys Gly 
65 ~ 70 75 80 

Phe Asp Val Thr Val Lys Tyr Thr Gin Gly Ser Trp Thr Gly Phe Val 
85 90 95 

Gly Glu Asp Leu Val Thr He Pro Lys Gly Phe Asn Thr Ser Phe Leu 
100 105 110 

Val Asn He Ala Thr lie Phe Glu Ser Glu Asn Phe Phe Leu Pro Gly 
115 120 125 

He Lys Trp Asn Gly He Leu Gly Leu Ala Tyr Ala Thr Leu Ala Lys 
130 135 ' 140 

Pro Ser Ser Ser Leu Glu Thr Phe Phe Asp Ser Leu Val Thr Gin Ala 
145 150 * 155 160 



Asn He Pro Asn Val Phe Ser Met Gin Met Cys Gly Ala Gly Leu Pro 
165 170 175 
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Val Ala Gly Ser Gly Thr Asn Gly Gly Ser Leu Val Leu Gly Gly lie 
180 185 190 

Glu Pro Ser Leu Tyr Lys Gly Asp lie Trp Tyr Thr Pro lie Lys Glu 
195 200 205 

Glu Trp Tyr Tyr Gin lie Glu lie Leu Lys Leu Glu lie Gly Gly Gin 
210 215 220 

Ser Leu Asn Leu Asp Cys Arg Glu Tyr Asn Ala Asp Lys Ala lie Val 
225 230 235 240 

Asp Ser Gly Thr Thr Leu Leu Arg Leu Pro Gin Lys Val Phe Asp Ala 
245 250 255 

val val Glu Ala Val Ala Arg Ala Ser Leu lie Pro Glu Phe Ser Asp 
260 265 270 

Gly Phe Trp Thr Gly Ser Gin Leu Ala Cys Trp Thr Asn Ser Glu Thr 
275 280 285 

Pro Trp Ser Tyr Phe Pro Lys lie Ser lie Tyr Leu Arg Asp Glu Asn 
290 295 300 

Ser Ser Arg Ser Phe Arg He Thr He Leu Pro Gin Leu Tyr He Gin 
305 310 315 320 

Pro Met Met Gly Ala Gly Leu Asn Tyr Glu Cys Tyr Arg Phe Gly He 
325 330 335 

Ser Pro Ser Thr Asn Ala Leu Val He Gly Ala Thr Val Met Glu Gly 
340 345 350 

Phe Tyr Val He Phe Asp Arg Ala Gin Lys Arg Val Gly Phe Ala Ala 
355 360 365 

Ser Pro Cys Ala Glu He Ala Gly Ala Ala Val Ser Glu He Ser Gly 
370 375 380 

Pro Phe Ser Thr Glu Asp Val Ala Ser Asn Cys Val Pro Ala Gin Ser 
385 390 395 400 

Leu Ser Glu Pro He Leu Trp His His His His His His 
405 410 



<210> 69 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 

<400> 69 

Gly Leu Ala Leu Ala Leu Glu Pro 
1 5 



<210> 70 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 70 

Glu Val Lys Met Asp Ala Glu Phe 
1 5 

<210> 71 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 71 

Glu Val Asn Leu Asp Ala Glu Phe 
1 5 



<210> 72 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 72 

Leu Val Phe Phe Ala Glu Asp Val 
1 5 



<210> 73 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 73 

Lys Leu Val Phe Phe Ala Glu Asp 
1 5 

<210> 74 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 74 

cgctttaagc ttgccaccat gggcgcactg gcccgggcg 39 



<210> 75 

<211> 57 

<212> DNA 

<213> Artificial 



Sequence 



<220> 
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<223> Description of Artificial Sequence: Primer 
<400> 75 

cgctttctcg agctaatggt gatggtgatg gtgccacaaa atgggctcgc tcaaaga 57 



<210> 76 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 76 

Arg Arg Gly Gly Val Val lie Ala Thr Val He Val Gly Glu Arg 
1 5 10 15 
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